The dataset examined has the following dimensions:
| Feature | Result |
|---|---|
| Number of observations | 419 |
| Number of variables | 4 |
The following variable checks were performed, depending on the data type of each variable:
| character | factor | labelled | haven labelled | numeric | integer | logical | Date | |
|---|---|---|---|---|---|---|---|---|
| Identify miscoded missing values | × | × | × | × | × | × | × | |
| Identify prefixed and suffixed whitespace | × | × | × | × | ||||
| Identify levels with < 6 obs. | × | × | × | × | ||||
| Identify case issues | × | × | × | × | ||||
| Identify misclassified numeric or integer variables | × | × | × | × | ||||
| Identify outliers | × | × | × |
Please note that all numerical values in the following have been rounded to 2 decimals.
| Variable class | # unique values | Missing observations | Any problems? | |
|---|---|---|---|---|
| SkIdActividad | numeric | 419 | 0.00 % | × |
| SkIdEmpresa | integer | 1 | 0.00 % | × |
| Descripcion | character | 282 | 0.00 % | × |
| Orden | integer | 210 | 0.00 % |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 419 |
| Median | 1004355731714487 |
| 1st and 3rd quartiles | 1002033041407545; 1006724442059458 |
| Min. and max. | 1004758460013; 1008861420467332 |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 282 |
| Mode | “Concreto pobre afinado” |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 210 |
| Median | 104 |
| 1st and 3rd quartiles | 52; 156.5 |
| Min. and max. | 0; 209 |
Report generation information:
Created by: RamiroSeb (username:
SEBASTIAN).
Report creation time: dom nov. 02 2025 14:16:10
Report was run from directory:
D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review
dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]
R version 4.5.0 (2025-04-11 ucrt).
Platform: x86_64-w64-mingw32/x64(America/Bogota).
Function call:
makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)
The dataset examined has the following dimensions:
| Feature | Result |
|---|---|
| Number of observations | 92153 |
| Number of variables | 4 |
The following variable checks were performed, depending on the data type of each variable:
| character | factor | labelled | haven labelled | numeric | integer | logical | Date | |
|---|---|---|---|---|---|---|---|---|
| Identify miscoded missing values | × | × | × | × | × | × | × | |
| Identify prefixed and suffixed whitespace | × | × | × | × | ||||
| Identify levels with < 6 obs. | × | × | × | × | ||||
| Identify case issues | × | × | × | × | ||||
| Identify misclassified numeric or integer variables | × | × | × | × | ||||
| Identify outliers | × | × | × |
Please note that all numerical values in the following have been rounded to 2 decimals.
| Variable class | # unique values | Missing observations | Any problems? | |
|---|---|---|---|---|
| SkIdBodega | integer | 193 | 0.00 % | × |
| SkIdEmpresa | integer | 1 | 0.00 % | × |
| CodigoBodega | integer | 1 | 0.00 % | × |
| Descripcion | character | 3 | 0.00 % |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 193 |
| Median | 1000165 |
| 1st and 3rd quartiles | 1000118; 1000224 |
| Min. and max. | 10003; 1000295 |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 3 |
| Mode | “Bodega Principal” |
Report generation information:
Created by: RamiroSeb (username:
SEBASTIAN).
Report creation time: dom nov. 02 2025 14:16:13
Report was run from directory:
D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review
dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]
R version 4.5.0 (2025-04-11 ucrt).
Platform: x86_64-w64-mingw32/x64(America/Bogota).
Function call:
makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)
The dataset examined has the following dimensions:
| Feature | Result |
|---|---|
| Number of observations | 2007 |
| Number of variables | 8 |
The following variable checks were performed, depending on the data type of each variable:
| character | factor | labelled | haven labelled | numeric | integer | logical | Date | |
|---|---|---|---|---|---|---|---|---|
| Identify miscoded missing values | × | × | × | × | × | × | × | |
| Identify prefixed and suffixed whitespace | × | × | × | × | ||||
| Identify levels with < 6 obs. | × | × | × | × | ||||
| Identify case issues | × | × | × | × | ||||
| Identify misclassified numeric or integer variables | × | × | × | × | ||||
| Identify outliers | × | × | × |
Please note that all numerical values in the following have been rounded to 2 decimals.
| Variable class | # unique values | Missing observations | Any problems? | |
|---|---|---|---|---|
| SkIdCapitulo | integer | 2007 | 0.00 % | × |
| SkIdEmpresa | integer | 1 | 0.00 % | × |
| Codigo.Proyecto | integer | 88 | 0.00 % | × |
| Capitulo.Numero | character | 80 | 0.00 % | × |
| Capitulo.Descripcion | character | 192 | 0.00 % | × |
| Tipo.Costo | character | 4 | 0.00 % | × |
| Tipo.Costo.Orden | integer | 4 | 0.00 % | × |
| Empresa | character | 1 | 0.00 % | × |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 2007 |
| Median | 1002042714 |
| 1st and 3rd quartiles | 1001161221.5; 1002423557.5 |
| Min. and max. | 100346; 1002954543 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 88 |
| Median | 204 |
| 1st and 3rd quartiles | 116; 242 |
| Min. and max. | 3; 295 |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 80 |
| Mode | “37” |
The following suspected missing value codes enter as regular values: "8", "9".
Note that the following levels have at most five observations: "01", "01 00 00", "02", "02 00 00", "03 00 00", …, "50", "55", "60", "60 00 00", "CI" (33 values omitted).
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 192 |
| Mode | “IMPREVISTOS” |
Note that the following levels have at most five observations: "ABERTURAS Y FACHADAS", "ACABADOS", "ACABADOS DE PISO", "ACABADOS EN MUROS", "ACERO DE REFUERZO", …, "URBANISMOS", "UTILIDAD", "VENTANERIA Y FACHADAS", "VENTANERIAS", "VIGILANCIA" (135 values omitted).
Note that there might be case problems with the following levels: "Equipos y herramientas", "EQUIPOS Y HERRAMIENTAS", "Estructura", "ESTRUCTURA", "Pañetes", "PAÑETES", "Pintura", "PINTURA", "Preliminares", "PRELIMINARES".
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 4 |
| Mode | “COSTOS DIRECTOS” |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 4 |
| Mode | “1” |
| Reference category | 0 |
Report generation information:
Created by: RamiroSeb (username:
SEBASTIAN).
Report creation time: dom nov. 02 2025 14:16:16
Report was run from directory:
D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review
dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]
R version 4.5.0 (2025-04-11 ucrt).
Platform: x86_64-w64-mingw32/x64(America/Bogota).
Function call:
makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)
The dataset examined has the following dimensions:
| Feature | Result |
|---|---|
| Number of observations | 36 |
| Number of variables | 5 |
The following variable checks were performed, depending on the data type of each variable:
| character | factor | labelled | haven labelled | numeric | integer | logical | Date | |
|---|---|---|---|---|---|---|---|---|
| Identify miscoded missing values | × | × | × | × | × | × | × | |
| Identify prefixed and suffixed whitespace | × | × | × | × | ||||
| Identify levels with < 6 obs. | × | × | × | × | ||||
| Identify case issues | × | × | × | × | ||||
| Identify misclassified numeric or integer variables | × | × | × | × | ||||
| Identify outliers | × | × | × |
Please note that all numerical values in the following have been rounded to 2 decimals.
| Variable class | # unique values | Missing observations | Any problems? | |
|---|---|---|---|---|
| SkIdClaseOrigen | integer | 36 | 0.00 % | |
| Clase | character | 7 | 0.00 % | × |
| Clase.Descripcion | character | 6 | 0.00 % | × |
| Origen | character | 21 | 0.00 % | × |
| Origen.Descripcion | character | 32 | 0.00 % | × |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 36 |
| Median | 18.5 |
| 1st and 3rd quartiles | 9.75; 28.25 |
| Min. and max. | 1; 38 |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 7 |
| Mode | “I” |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 6 |
| Mode | “Invertido” |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 21 |
| Mode | “C” |
The following suspected missing value codes enter as regular values: "".
Note that the following levels have at most five observations: "", "C", "D", "E", "ED", …, "TE", "TS", "V", "X", "Y" (11 values omitted).
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 32 |
| Mode | “Cuentas Control” |
The following suspected missing value codes enter as regular values: "".
Note that the following levels have at most five observations: "", "Actas Descuento Menor Valor", "Actas Generales", "Actas Por Grupos", "Actas Todo Costo", …, "Transformacion Entradas", "Transformacion Salidas", "Traslados Entradas", "Traslados Salidas", "Valores Comprados" (22 values omitted).
Report generation information:
Created by: RamiroSeb (username:
SEBASTIAN).
Report creation time: dom nov. 02 2025 14:16:20
Report was run from directory:
D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review
dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]
R version 4.5.0 (2025-04-11 ucrt).
Platform: x86_64-w64-mingw32/x64(America/Bogota).
Function call:
makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)
The dataset examined has the following dimensions:
| Feature | Result |
|---|---|
| Number of observations | 1 |
| Number of variables | 6 |
The following variable checks were performed, depending on the data type of each variable:
| character | factor | labelled | haven labelled | numeric | integer | logical | Date | |
|---|---|---|---|---|---|---|---|---|
| Identify miscoded missing values | × | × | × | × | × | × | × | |
| Identify prefixed and suffixed whitespace | × | × | × | × | ||||
| Identify levels with < 6 obs. | × | × | × | × | ||||
| Identify case issues | × | × | × | × | ||||
| Identify misclassified numeric or integer variables | × | × | × | × | ||||
| Identify outliers | × | × | × |
Please note that all numerical values in the following have been rounded to 2 decimals.
| Variable class | # unique values | Missing observations | Any problems? | |
|---|---|---|---|---|
| SkIdEmpresa | integer | 1 | 0.00 % | × |
| NombreEmpresa | character | 1 | 0.00 % | × |
| Nit | integer | 1 | 0.00 % | × |
| Direccion | character | 1 | 0.00 % | × |
| Ref_IdEmpresa | integer | 1 | 0.00 % | × |
| Ref_BdConfServidor | integer | 1 | 0.00 % | × |
The variable is a key (distinct values for each observation).
The variable only takes one (non-missing) value: "ARPRO ARQUITECTOS INGENIEROS S.A.S". The variable contains 0 % missing observations.
The variable is a key (distinct values for each observation).
The variable only takes one (non-missing) value: "CRA 19 No 90-10". The variable contains 0 % missing observations.
Report generation information:
Created by: RamiroSeb (username:
SEBASTIAN).
Report creation time: dom nov. 02 2025 14:16:23
Report was run from directory:
D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review
dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]
R version 4.5.0 (2025-04-11 ucrt).
Platform: x86_64-w64-mingw32/x64(America/Bogota).
Function call:
makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)
The dataset examined has the following dimensions:
| Feature | Result |
|---|---|
| Number of observations | 111611 |
| Number of variables | 5 |
The following variable checks were performed, depending on the data type of each variable:
| character | factor | labelled | haven labelled | numeric | integer | logical | Date | |
|---|---|---|---|---|---|---|---|---|
| Identify miscoded missing values | × | × | × | × | × | × | × | |
| Identify prefixed and suffixed whitespace | × | × | × | × | ||||
| Identify levels with < 6 obs. | × | × | × | × | ||||
| Identify case issues | × | × | × | × | ||||
| Identify misclassified numeric or integer variables | × | × | × | × | ||||
| Identify outliers | × | × | × |
Please note that all numerical values in the following have been rounded to 2 decimals.
| Variable class | # unique values | Missing observations | Any problems? | |
|---|---|---|---|---|
| SkIdEmpresa | integer | 1 | 0.00 % | × |
| SkIdPedido | integer | 111611 | 0.00 % | × |
| Codigo.Orden.De.Compra | numeric | 23817 | 20.49 % | × |
| Pedido.Urgente | character | 2 | 0.00 % | |
| Tipo.Pedido | character | 2 | 0.00 % |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 111611 |
| Median | 10080194 |
| 1st and 3rd quartiles | 10035733.5; 100111863.5 |
| Min. and max. | 100108; 100141328 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 22867 (20.49 %) |
| Number of unique values | 23816 |
| Median | 16700167.5 |
| 1st and 3rd quartiles | 350482.75; 22500065.25 |
| Min. and max. | 30083; 29500001 |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 2 |
| Mode | “NO” |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 2 |
| Mode | “ADICIONAL” |
Report generation information:
Created by: RamiroSeb (username:
SEBASTIAN).
Report creation time: dom nov. 02 2025 14:16:26
Report was run from directory:
D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review
dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]
R version 4.5.0 (2025-04-11 ucrt).
Platform: x86_64-w64-mingw32/x64(America/Bogota).
Function call:
makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)
The dataset examined has the following dimensions:
| Feature | Result |
|---|---|
| Number of observations | 55896 |
| Number of variables | 6 |
The following variable checks were performed, depending on the data type of each variable:
| character | factor | labelled | haven labelled | numeric | integer | logical | Date | |
|---|---|---|---|---|---|---|---|---|
| Identify miscoded missing values | × | × | × | × | × | × | × | |
| Identify prefixed and suffixed whitespace | × | × | × | × | ||||
| Identify levels with < 6 obs. | × | × | × | × | ||||
| Identify case issues | × | × | × | × | ||||
| Identify misclassified numeric or integer variables | × | × | × | × | ||||
| Identify outliers | × | × | × |
Please note that all numerical values in the following have been rounded to 2 decimals.
| Variable class | # unique values | Missing observations | Any problems? | |
|---|---|---|---|---|
| SkIdEmpresa | integer | 1 | 0.00 % | × |
| SkIdEspecificacionActas | numeric | 55896 | 0.00 % | × |
| No.Acta | integer | 385 | 0.00 % | × |
| No.Contrato | integer | 11407 | 0.00 % | × |
| No.Factura | character | 43507 | 0.00 % | × |
| Codigo.de.barras | integer | 55896 | 0.00 % |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 55896 |
| Median | 10021021002376.5 |
| 1st and 3rd quartiles | 10010810801223.8; 10027627600024.2 |
| Min. and max. | 1003300011; 1002492490076239 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 385 |
| Median | 4 |
| 1st and 3rd quartiles | 2; 11 |
| Min. and max. | 1; 385 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 11407 |
| Median | 2000297.5 |
| 1st and 3rd quartiles | 1080095; 2280421 |
| Min. and max. | 30001; 2940002 |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 1 (0 %) |
| Number of unique values | 43506 |
| Mode | “” |
The following suspected missing value codes enter as regular values: "", "8", "88", "888", "9", "99", "999".
The following values appear with prefixed or suffixed white space: "112873035 ", "21 ", "63919687 ", "71682466 ", "A2431 ", …, "RO382 ", "RO560 ", "TRAY42 ", "VSFE1437 ", "VSFE740 " (27 values omitted).
Note that the following levels have at most five observations: " 01103", " 0122458", " 10209", " 105412587", " 105415234", …, "ZA82", "ZA84", "ZA9", "ZC1740", "ZC2204" (43195 values omitted).
Note that there might be case problems with the following levels: "Ajuste", "AJUSTE", "anulada", "ANULADA", "anulado", …, "fe114", "FE114", "no pago", "No Pago", "NO PAGO" (15 values omitted).
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 55896 |
| Median | 49050 |
| 1st and 3rd quartiles | 20845.25; 65503.25 |
| Min. and max. | 49; 86400 |
Report generation information:
Created by: RamiroSeb (username:
SEBASTIAN).
Report creation time: dom nov. 02 2025 14:16:32
Report was run from directory:
D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review
dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]
R version 4.5.0 (2025-04-11 ucrt).
Platform: x86_64-w64-mingw32/x64(America/Bogota).
Function call:
makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)
The dataset examined has the following dimensions:
| Feature | Result |
|---|---|
| Number of observations | 11950 |
| Number of variables | 10 |
The following variable checks were performed, depending on the data type of each variable:
| character | factor | labelled | haven labelled | numeric | integer | logical | Date | |
|---|---|---|---|---|---|---|---|---|
| Identify miscoded missing values | × | × | × | × | × | × | × | |
| Identify prefixed and suffixed whitespace | × | × | × | × | ||||
| Identify levels with < 6 obs. | × | × | × | × | ||||
| Identify case issues | × | × | × | × | ||||
| Identify misclassified numeric or integer variables | × | × | × | × | ||||
| Identify outliers | × | × | × |
Please note that all numerical values in the following have been rounded to 2 decimals.
| Variable class | # unique values | Missing observations | Any problems? | |
|---|---|---|---|---|
| SkIdEmpresa | integer | 1 | 0.00 % | × |
| SkIdContrato | numeric | 11950 | 0.00 % | |
| No..Contrato | integer | 11950 | 0.00 % | × |
| Descripcion | character | 9769 | 0.00 % | × |
| Formas.de.pago | character | 1169 | 0.00 % | × |
| Clase.Contrato | character | 3 | 0.00 % | × |
| Fecha.de.creacion | character | 3452 | 0.00 % | × |
| Usuario.de.creacion | character | 97 | 0.00 % | × |
| Fecha.Inicio | character | 3475 | 0.00 % | × |
| Fecha.Fin | character | 3001 | 0.00 % | × |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 11950 |
| Median | 1001881880267.5 |
| 1st and 3rd quartiles | 10035350308.25; 1002282280720.75 |
| Min. and max. | 100330001; 1002952950003 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 11950 |
| Median | 1880267.5 |
| 1st and 3rd quartiles | 350308.25; 2280720.75 |
| Min. and max. | 30001; 2950003 |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 9769 |
| Mode | “ACARREOS URBANOS” |
The following suspected missing value codes enter as regular values: "".
The following values appear with prefixed or suffixed white space: " ALQUILER DE BAÑO PORTATIL", " Alquiler de equipo etapa 2", " Alquiler Retroexcavadora E42 para construcción de plataformas de trabajo ", " Comisión de topografía", " CONSOLIDACION Y MAMPOSTERIA PAÑETES Y VANOS ANEXIDADES – CLAUSTRO – TEMPLO - CLUB", …, "VISITA TECNICA TRANSFORMADORES ", "Visitas de geotecnia ", "Visitas de Geotecnista ", "Visitas de topografía ", "volante, pasa calles, eventos " (1713 values omitted).
Note that the following levels have at most five observations: "- Actualización de la cimentación de la Torre A (Interior 11) de acuerdo al levantamiento topográfico elaborado por la obra.\n- Diseño de tanque de agu", " ALQUILER DE BAÑO PORTATIL", " Alquiler de equipo etapa 2", " Alquiler Retroexcavadora E42 para construcción de plataformas de trabajo ", " Comisión de topografía", …, "VISITAS TECNICAS (ASESORIA ESTRUCTURAL)", "VOLADURAS CONTROLADAS CIMENTACION EXISTENTE", "volante, pasa calles, eventos ", "Volquetas retiro de material sobrante (escombro)", "Workstation Preci 3581 Int Ci7 13700h 16g/1ts W11" (9676 values omitted).
Note that there might be case problems with the following levels: "Acarreos obra ", "ACARREOS OBRA ", "Acarreos urbanos", "Acarreos Urbanos", "ACARREOS URBANOS", …, "Transporte y disposición de residuos", "TRANSPORTE Y DISPOSICIÓN DE RESIDUOS", "Transportes urbanos", "Transportes Urbanos", "TRANSPORTES URBANOS" (567 values omitted).
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 1169 |
| Mode | “” |
The following suspected missing value codes enter as regular values: "".
The following values appear with prefixed or suffixed white space: " 50% material en obra, saldo 50% ontra entrega", " actas parciales de acuerdo ala avance de obra", " actas segun avance de obra", " anticipo y actas catorcenal", " Cortes de Obra ", …, "un solo corte ", "UN SOLO CORTE ", "Una sola vez ", "UNICA VEZ ", "unico pago " (174 values omitted).
Note that the following levels have at most five observations: " 50% material en obra, saldo 50% ontra entrega", " actas parciales de acuerdo ala avance de obra", " actas segun avance de obra", " anticipo y actas catorcenal", " Cortes de Obra ", …, "unico Pago", "Unico pago", "UNICO PAGO", "unico pago ", "UNICO POR CONTRATO" (1022 values omitted).
Note that there might be case problems with the following levels: "10% anticipo; cortes quincenales", "10% Anticipo; cortes quincenales", "100% anticipado", "100% Anticipado", "100% contraentrega", …, "unico pago", "unico Pago", "Unico pago", "Unico Pago", "UNICO PAGO" (377 values omitted).
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 3 |
| Mode | “GENERALES” |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 3452 |
| Mode | “17/03/2015” |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 97 |
| Mode | “” |
The following suspected missing value codes enter as regular values: "".
The following values appear with prefixed or suffixed white space: "Ana Maria Leon ", "Juan Felipe Murillo ", "Maria Mercedes Arias ", "William Alfredo Fernandez Leon ".
Note that the following levels have at most five observations: "Agustin Bolivar", "Andrés Camilo Montañez", "Cesar David Sotaquira", "Daniel Alejandro Viana", "Edgar Joaquín Ríos", …, "Maria Angelica Oliva", "Mauricio Lemus", "Miguel Matamala", "Natalia Moreno", "Tania Alejandra Acevedo Barriga" (8 values omitted).
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 3475 |
| Mode | “01/10/2024” |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 3001 |
| Mode | “31/12/2024” |
Report generation information:
Created by: RamiroSeb (username:
SEBASTIAN).
Report creation time: dom nov. 02 2025 14:19:52
Report was run from directory:
D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review
dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]
R version 4.5.0 (2025-04-11 ucrt).
Platform: x86_64-w64-mingw32/x64(America/Bogota).
Function call:
makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)
The dataset examined has the following dimensions:
| Feature | Result |
|---|---|
| Number of observations | 63318 |
| Number of variables | 6 |
The following variable checks were performed, depending on the data type of each variable:
| character | factor | labelled | haven labelled | numeric | integer | logical | Date | |
|---|---|---|---|---|---|---|---|---|
| Identify miscoded missing values | × | × | × | × | × | × | × | |
| Identify prefixed and suffixed whitespace | × | × | × | × | ||||
| Identify levels with < 6 obs. | × | × | × | × | ||||
| Identify case issues | × | × | × | × | ||||
| Identify misclassified numeric or integer variables | × | × | × | × | ||||
| Identify outliers | × | × | × |
Please note that all numerical values in the following have been rounded to 2 decimals.
| Variable class | # unique values | Missing observations | Any problems? | |
|---|---|---|---|---|
| SkIdEmpresa | integer | 1 | 0.00 % | × |
| SkIdEspecificacionEntradasAlmacen | numeric | 63302 | 0.00 % | × |
| No.Entrada | integer | 63302 | 0.00 % | |
| Remision | character | 49501 | 0.00 % | × |
| No.Factura | character | 48192 | 0.00 % | × |
| Codigo.de.barras | integer | 63318 | 0.00 % | × |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 63302 |
| Median | 1002282280270.5 |
| 1st and 3rd quartiles | 1001081080355.25; 10021721700443.8 |
| Min. and max. | 100330001; 10029429400005 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 63302 |
| Median | 2280270.5 |
| 1st and 3rd quartiles | 1080355.25; 21700443.75 |
| Min. and max. | 30001; 29400005 |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 49501 |
| Mode | “” |
The following suspected missing value codes enter as regular values: "", ".", ".215141 ", ".30265", ".4024", "8", "88", "9", "99", "9999".
The following values appear with prefixed or suffixed white space: " 209164", ".215141 ", "43-00003022 ".
Note that the following levels have at most five observations: " 209164", ".", ".215141 ", ".30265", ".4024", …, "WHS 151728", "WHS 152294", "WHS141391", "WHS142622", "WHS144190" (49390 values omitted).
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 48192 |
| Mode | “” |
The following suspected missing value codes enter as regular values: "", "88", "99".
The following values appear with prefixed or suffixed white space: " 1012116280", " CRE 23870", " F290-00117092", " F33070", " F33537", …, "FACT148173 ", "PR259-17 ", "PR281-17 ", "RI2399839 ", "RI29348 " (14 values omitted).
Note that the following levels have at most five observations: " 1012116280", " CRE 23870", " F290-00117092", " F33070", " F33537", …, "X2621041243", "X2651025554", "X2742507339", "X2861025569", "YBE887875" (47741 values omitted).
Note that there might be case problems with the following levels: "22v218042", "22V218042", "f333", "F333", "f5602", …, "FIG083470", "pf46794", "PF46794", "Toc15884", "TOC15884" (8 values omitted).
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 63318 |
| Median | 142240.5 |
| 1st and 3rd quartiles | 119185.5; 158829.75 |
| Min. and max. | 118; 175079 |
Report generation information:
Created by: RamiroSeb (username:
SEBASTIAN).
Report creation time: dom nov. 02 2025 14:20:17
Report was run from directory:
D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review
dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]
R version 4.5.0 (2025-04-11 ucrt).
Platform: x86_64-w64-mingw32/x64(America/Bogota).
Function call:
makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)
The dataset examined has the following dimensions:
| Feature | Result |
|---|---|
| Number of observations | 29 |
| Number of variables | 4 |
The following variable checks were performed, depending on the data type of each variable:
| character | factor | labelled | haven labelled | numeric | integer | logical | Date | |
|---|---|---|---|---|---|---|---|---|
| Identify miscoded missing values | × | × | × | × | × | × | × | |
| Identify prefixed and suffixed whitespace | × | × | × | × | ||||
| Identify levels with < 6 obs. | × | × | × | × | ||||
| Identify case issues | × | × | × | × | ||||
| Identify misclassified numeric or integer variables | × | × | × | × | ||||
| Identify outliers | × | × | × |
Please note that all numerical values in the following have been rounded to 2 decimals.
| Variable class | # unique values | Missing observations | Any problems? | |
|---|---|---|---|---|
| SkIdEmpresa | integer | 1 | 0.00 % | × |
| SkIdEspecificacionEjecucionCliente | numeric | 29 | 0.00 % | × |
| NoActaCliente | integer | 26 | 0.00 % | × |
| ContratoCliente | character | 2 | 0.00 % | × |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 29 |
| Median | 10018812 |
| 1st and 3rd quartiles | 1001885; 10018819 |
| Min. and max. | 10061; 10027527500003 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 26 |
| Median | 12 |
| 1st and 3rd quartiles | 5; 19 |
| Min. and max. | 1; 27500003 |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 2 |
| Mode | “” |
The following suspected missing value codes enter as regular values: "".
Note that the following levels have at most five observations: "27/08/2024".
Report generation information:
Created by: RamiroSeb (username:
SEBASTIAN).
Report creation time: dom nov. 02 2025 14:27:16
Report was run from directory:
D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review
dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]
R version 4.5.0 (2025-04-11 ucrt).
Platform: x86_64-w64-mingw32/x64(America/Bogota).
Function call:
makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)
The dataset examined has the following dimensions:
| Feature | Result |
|---|---|
| Number of observations | 3 |
| Number of variables | 2 |
The following variable checks were performed, depending on the data type of each variable:
| character | factor | labelled | haven labelled | numeric | integer | logical | Date | |
|---|---|---|---|---|---|---|---|---|
| Identify miscoded missing values | × | × | × | × | × | × | × | |
| Identify prefixed and suffixed whitespace | × | × | × | × | ||||
| Identify levels with < 6 obs. | × | × | × | × | ||||
| Identify case issues | × | × | × | × | ||||
| Identify misclassified numeric or integer variables | × | × | × | × | ||||
| Identify outliers | × | × | × |
Please note that all numerical values in the following have been rounded to 2 decimals.
| Variable class | # unique values | Missing observations | Any problems? | |
|---|---|---|---|---|
| SkIdEstadoEnvioDocumento | integer | 3 | 0.00 % | × |
| Descripcion | character | 3 | 0.00 % | × |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 3 |
| Mode | “-1” |
| Reference category | -1 |
Report generation information:
Created by: RamiroSeb (username:
SEBASTIAN).
Report creation time: dom nov. 02 2025 14:27:19
Report was run from directory:
D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review
dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]
R version 4.5.0 (2025-04-11 ucrt).
Platform: x86_64-w64-mingw32/x64(America/Bogota).
Function call:
makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)
The dataset examined has the following dimensions:
| Feature | Result |
|---|---|
| Number of observations | 70 |
| Number of variables | 6 |
The following variable checks were performed, depending on the data type of each variable:
| character | factor | labelled | haven labelled | numeric | integer | logical | Date | |
|---|---|---|---|---|---|---|---|---|
| Identify miscoded missing values | × | × | × | × | × | × | × | |
| Identify prefixed and suffixed whitespace | × | × | × | × | ||||
| Identify levels with < 6 obs. | × | × | × | × | ||||
| Identify case issues | × | × | × | × | ||||
| Identify misclassified numeric or integer variables | × | × | × | × | ||||
| Identify outliers | × | × | × |
Please note that all numerical values in the following have been rounded to 2 decimals.
| Variable class | # unique values | Missing observations | Any problems? | |
|---|---|---|---|---|
| SkIdEstadoPorDocumento | integer | 70 | 0.00 % | × |
| SkIdEmpresa | integer | 1 | 0.00 % | × |
| SkIdEstado | integer | 21 | 0.00 % | × |
| Descripcion.Estado | character | 50 | 0.00 % | × |
| Tipo.Documento | character | 14 | 0.00 % | × |
| Empresa | character | 1 | 0.00 % | × |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 70 |
| Median | 10072.5 |
| 1st and 3rd quartiles | 10026.25; 100122.5 |
| Min. and max. | -100111; 1006200006 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 21 |
| Median | 2 |
| 1st and 3rd quartiles | 0; 4 |
| Min. and max. | -6; 200006 |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 50 |
| Mode | “APROBADO” |
The following values appear with prefixed or suffixed white space: "Por Aprobación ", "Por Preaprobación ".
Note that the following levels have at most five observations: "Abierto", "AJUSTES GENERADOS", "Anulada", "APROBACIÓN DE ACTAS", "Aprobada", …, "RECHAZADO INTERVENTOR", "RECHAZADO TÉCNICO", "SOLICITADA", "SOLICITADO", "TÉCNICO" (40 values omitted).
Note that there might be case problems with the following levels: "Aprobada", "APROBADA", "Aprobado", "APROBADO", "Cerrado", …, "NO PAGO", "Programada", "PROGRAMADA", "Programado", "PROGRAMADO" (4 values omitted).
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 14 |
| Mode | “PEDIDOS” |
Report generation information:
Created by: RamiroSeb (username:
SEBASTIAN).
Report creation time: dom nov. 02 2025 14:27:22
Report was run from directory:
D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review
dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]
R version 4.5.0 (2025-04-11 ucrt).
Platform: x86_64-w64-mingw32/x64(America/Bogota).
Function call:
makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)
The dataset examined has the following dimensions:
| Feature | Result |
|---|---|
| Number of observations | 22645 |
| Number of variables | 14 |
The following variable checks were performed, depending on the data type of each variable:
| character | factor | labelled | haven labelled | numeric | integer | logical | Date | |
|---|---|---|---|---|---|---|---|---|
| Identify miscoded missing values | × | × | × | × | × | × | × | |
| Identify prefixed and suffixed whitespace | × | × | × | × | ||||
| Identify levels with < 6 obs. | × | × | × | × | ||||
| Identify case issues | × | × | × | × | ||||
| Identify misclassified numeric or integer variables | × | × | × | × | ||||
| Identify outliers | × | × | × |
Please note that all numerical values in the following have been rounded to 2 decimals.
| Variable class | # unique values | Missing observations | Any problems? | |
|---|---|---|---|---|
| SkIdFecha | integer | 22645 | 0.00 % | × |
| Fecha | character | 22645 | 0.00 % | × |
| Año | integer | 62 | 0.00 % | × |
| Mes | integer | 12 | 0.00 % | |
| Dia | integer | 31 | 0.00 % | |
| DiaDelAño | integer | 366 | 0.00 % | |
| SemanaDelAño | integer | 54 | 0.00 % | |
| Trimestre | integer | 4 | 0.00 % | |
| Semestre | integer | 2 | 0.00 % | |
| NombreMes | character | 12 | 0.00 % | |
| NombreMesCorto | character | 12 | 0.00 % | |
| NombreDia | character | 7 | 0.00 % | |
| NombreDiaCorto | character | 7 | 0.00 % | |
| MesAño | character | 732 | 0.00 % |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 22645 |
| Median | 20200101 |
| 1st and 3rd quartiles | 20040702; 20350702 |
| Min. and max. | 19000101; 20501231 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 62 |
| Median | 2020 |
| 1st and 3rd quartiles | 2004; 2035 |
| Min. and max. | 1900; 2050 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 12 |
| Median | 7 |
| 1st and 3rd quartiles | 4; 10 |
| Min. and max. | 1; 12 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 31 |
| Median | 16 |
| 1st and 3rd quartiles | 8; 23 |
| Min. and max. | 1; 31 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 366 |
| Median | 183 |
| 1st and 3rd quartiles | 92; 274 |
| Min. and max. | 1; 366 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 54 |
| Median | 27 |
| 1st and 3rd quartiles | 14; 40 |
| Min. and max. | 1; 54 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 4 |
| Mode | “3” |
| Reference category | 1 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 2 |
| Mode | “2” |
| Reference category | 1 |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 12 |
| Mode | “Agosto” |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 12 |
| Mode | “Ago” |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 7 |
| Mode | “Lunes” |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 7 |
| Mode | “Lun” |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 732 |
| Mode | “Ago-00” |
Report generation information:
Created by: RamiroSeb (username:
SEBASTIAN).
Report creation time: dom nov. 02 2025 14:27:27
Report was run from directory:
D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review
dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]
R version 4.5.0 (2025-04-11 ucrt).
Platform: x86_64-w64-mingw32/x64(America/Bogota).
Function call:
makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)
The dataset examined has the following dimensions:
| Feature | Result |
|---|---|
| Number of observations | 19251 |
| Number of variables | 24 |
The following variable checks were performed, depending on the data type of each variable:
| character | factor | labelled | haven labelled | numeric | integer | logical | Date | |
|---|---|---|---|---|---|---|---|---|
| Identify miscoded missing values | × | × | × | × | × | × | × | |
| Identify prefixed and suffixed whitespace | × | × | × | × | ||||
| Identify levels with < 6 obs. | × | × | × | × | ||||
| Identify case issues | × | × | × | × | ||||
| Identify misclassified numeric or integer variables | × | × | × | × | ||||
| Identify outliers | × | × | × |
Please note that all numerical values in the following have been rounded to 2 decimals.
| Variable class | # unique values | Missing observations | Any problems? | |
|---|---|---|---|---|
| SkIdInsumo | integer | 19251 | 0.00 % | × |
| SkIdEmpresa | integer | 1 | 0.00 % | × |
| Empresa | character | 1 | 0.00 % | × |
| Codigo.Insumo | integer | 19251 | 0.00 % | |
| Insumo.Descripcion | character | 19251 | 0.00 % | × |
| Agrupacion | numeric | 287 | 0.00 % | × |
| Agrupacion.Descripcion | character | 287 | 0.00 % | × |
| Tipo | character | 6 | 0.00 % | |
| Tipo.Descripcion | character | 6 | 0.00 % | |
| Unidad | character | 30 | 0.00 % | × |
| Descripcion.Unidad | character | 30 | 0.00 % | × |
| Estado | character | 1 | 0.00 % | × |
| Requiere.Equipo | character | 1 | 0.00 % | × |
| Dias.Reposicion | integer | 6 | 0.00 % | × |
| SubAnalisis | character | 1 | 0.00 % | × |
| Devolutivo | character | 2 | 0.00 % | |
| Stock.Maximo | integer | 1 | 0.00 % | × |
| Stock.Minimo | integer | 1 | 0.00 % | × |
| Valor.Unitario | numeric | 9746 | 0.00 % | × |
| Porcentaje.IVA | numeric | 5 | 0.00 % | |
| Valor.Neto | numeric | 10052 | 0.00 % | × |
| Fecha.Creacion | character | 1999 | 0.00 % | × |
| Fecha.Modificacion | character | 1687 | 0.00 % | × |
| Codigo.Insumo.Id | integer | 19251 | 0.00 % |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 19251 |
| Median | 1009729 |
| 1st and 3rd quartiles | 1004915.5; 10014543.5 |
| Min. and max. | 100101; 10019356 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 19251 |
| Median | 9729 |
| 1st and 3rd quartiles | 4915.5; 14543.5 |
| Min. and max. | 101; 19356 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 287 |
| Median | 1301 |
| 1st and 3rd quartiles | 901; 2504 |
| Min. and max. | 101; 9001 |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 287 |
| Mode | “Demás Elementos de Ferretería” |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 6 |
| Mode | “M” |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 6 |
| Mode | “Materiales” |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 30 |
| Mode | “un” |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 30 |
| Mode | “UNIDAD” |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 6 |
| Median | 0 |
| 1st and 3rd quartiles | 0; 0 |
| Min. and max. | 0; 60 |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 2 |
| Mode | “NO” |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 9746 |
| Median | 26010.34 |
| 1st and 3rd quartiles | 60; 233122.55 |
| Min. and max. | 0; 9730090057.86 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 5 |
| Mode | “19” |
| Reference category | 0 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 10052 |
| Median | 30000 |
| 1st and 3rd quartiles | 71.4; 266457.02 |
| Min. and max. | 0; 9730090057.86 |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 1999 |
| Mode | “06/12/2010” |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 1687 |
| Mode | “” |
The following suspected missing value codes enter as regular values: "".
Note that the following levels have at most five observations: "01/02/2012", "01/03/2013", "01/03/2015", "01/03/2016", "01/04/2015", …, "31/08/2015", "31/08/2023", "31/10/2011", "31/10/2013", "31/10/2014" (1224 values omitted).
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 19251 |
| Median | 9729 |
| 1st and 3rd quartiles | 4915.5; 14543.5 |
| Min. and max. | 101; 19356 |
Report generation information:
Created by: RamiroSeb (username:
SEBASTIAN).
Report creation time: dom nov. 02 2025 14:27:35
Report was run from directory:
D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review
dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]
R version 4.5.0 (2025-04-11 ucrt).
Platform: x86_64-w64-mingw32/x64(America/Bogota).
Function call:
makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)
The dataset examined has the following dimensions:
| Feature | Result |
|---|---|
| Number of observations | 49311 |
| Number of variables | 23 |
The following variable checks were performed, depending on the data type of each variable:
| character | factor | labelled | haven labelled | numeric | integer | logical | Date | |
|---|---|---|---|---|---|---|---|---|
| Identify miscoded missing values | × | × | × | × | × | × | × | |
| Identify prefixed and suffixed whitespace | × | × | × | × | ||||
| Identify levels with < 6 obs. | × | × | × | × | ||||
| Identify case issues | × | × | × | × | ||||
| Identify misclassified numeric or integer variables | × | × | × | × | ||||
| Identify outliers | × | × | × |
Please note that all numerical values in the following have been rounded to 2 decimals.
| Variable class | # unique values | Missing observations | Any problems? | |
|---|---|---|---|---|
| SkIdItems | integer | 49311 | 0.00 % | × |
| SkIdEmpresa | integer | 1 | 0.00 % | × |
| SkIdAPU | integer | 37856 | 0.00 % | |
| SkIdNivel | integer | 1 | 0.00 % | × |
| Empresa | character | 1 | 0.00 % | × |
| Item.No | character | 21497 | 0.00 % | × |
| SubCapitulo | character | 512 | 0.00 % | × |
| Item.Descripcion | character | 23786 | 0.00 % | × |
| Cantidad | numeric | 11916 | 0.00 % | × |
| Valor.Sin.IVA | numeric | 17985 | 0.00 % | × |
| Precio.Venta | numeric | 476 | 0.00 % | × |
| Codigo.Cliente | character | 367 | 0.00 % | × |
| Cantidad.Proyectada | numeric | 9796 | 0.00 % | × |
| Unidad.Medida | character | 62 | 0.00 % | × |
| Item.estado | character | 3 | 0.00 % | |
| Metro.cuadrado | numeric | 3 | 0.00 % | |
| Aplica.En.Contratos | character | 2 | 0.00 % | |
| Aplica.En.Almacen | character | 2 | 0.00 % | |
| Bloqueo.De.Contratos.Por.Cantidad | character | 2 | 0.00 % | |
| Bloqueo.De.Contratos.Por.Valor | character | 2 | 0.00 % | |
| Bloqueo.De.Salidas.Por.Cantidad | character | 2 | 0.00 % | |
| Bloqueo.De.Salidas.Por.Valor | character | 2 | 0.00 % | |
| Clase.Item | logical | 1 | 100.00 % | × |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 49311 |
| Median | 10077795 |
| 1st and 3rd quartiles | 10038965.5; 100106680.5 |
| Min. and max. | 1002462; 100145937 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 37856 |
| Median | 2110861 |
| 1st and 3rd quartiles | 1173627; 226011666.5 |
| Min. and max. | 30004; 295021079 |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 21497 |
| Mode | “7.001” |
The following suspected missing value codes enter as regular values: "".
The following values appear with prefixed or suffixed white space: "01 54 00 80 ", "01 56 26 1 ", "01 91 13 1 ", "09 30 13 51 ", "23.704 ", "25.05.801 ", "26 43 13 1 ".
Note that the following levels have at most five observations: "|25.04.003", "01 31 13 1.1", "01 31 13 1.2", "01 31 13 10.1", "01 31 13 10.2", …, "9.96", "9.97", "9.98", "9.990", "9.991" (19506 values omitted).
Note that there might be case problems with the following levels: "2.01.106.2a", "2.01.106.2A", "2.01.106.2b", "2.01.106.2B", "2.01.106.3a", …, "2.01.106.3B", "2.01.106.4a", "2.01.106.4A", "2.01.106.4b", "2.01.106.4B" (2 values omitted).
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 512 |
| Mode | “” |
The following suspected missing value codes enter as regular values: "", "-".
The following values appear with prefixed or suffixed white space: " Generales", " Torre 2", " Unidad estructural 8", "Plataforma - E1 ", "Reclamaciones ".
Note that the following levels have at most five observations: " Torre 2", "|", "100x100x5 Protección pasiva contra fuego", "200x200x5 Protección pasiva contra fuego", "300x300x1/2 Protección pasiva contra fuego", …, "Washer", "Zapata ARC", "Zapata ARE", "Zarpa muro de contención ARC", "Zonas Comunales" (229 values omitted).
Note that there might be case problems with the following levels: "claustro", "Claustro", "Comercio", "COMERCIO", "comunes", …, "Urbanismo Interno", "Vivienda", "VIVIENDA", "Zonas comunales", "Zonas Comunales" (33 values omitted).
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 23786 |
| Mode | “Concreto pobre” |
The following suspected missing value codes enter as regular values: ".", "..".
The following values appear with prefixed or suffixed white space: " Administración de obra (Costo reembolsable)", " Ajuste TRM Puertas cortafuego Almacen el Arq", " Banca coworking C-20", " Banca coworking C-21", " CIELO RASO EN LAMINA DE PVC", …, "Vigas de amarres concreto 4000 psi ", "Vigas y viguetas segunda etapa ", "Vinilo sobre pañete ", "Win plástico ", "Zonas duras " (1089 values omitted).
Note that the following levels have at most five observations: " Administración de obra (Costo reembolsable)", " Banca coworking C-20", " Banca coworking C-21", " CIELO RASO EN LAMINA DE PVC", " CIELO RASO EN PANELES DE YESO 1/2”+", …, "Zona verde piso 1", "Zonas comunales - BBQ", "Zonas duras ", "Zonas Verdes", "Zorra metálica canecas" (22274 values omitted).
Note that there might be case problems with the following levels: "Acero de 60000 psi cimentación", "Acero de 60000 PSI cimentación", "Acero de 60000 psi pilotes", "Acero de 60000 PSI pilotes", "Acero de refuerzo escaleras", …, "Vigas de Cimentación en Concreto", "Win plástico", "Win Plástico", "Zapatas en concreto 3000 psi", "Zapatas en concreto 3000 PSI" (272 values omitted).
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 11916 |
| Median | 16.7 |
| 1st and 3rd quartiles | 1; 157.76 |
| Min. and max. | 0; 7225562 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 17985 |
| Median | 464231.12 |
| 1st and 3rd quartiles | 0; 10717500.12 |
| Min. and max. | -176105480; 1.8e+11 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 476 |
| Median | 0 |
| 1st and 3rd quartiles | 0; 0 |
| Min. and max. | 0; 1.4e+09 |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 367 |
| Mode | “” |
The following suspected missing value codes enter as regular values: "".
Note that the following levels have at most five observations: "1.01.01", "1.01.02", "1.01.03", "1.01.04", "1.01.05", …, "OC 95", "OC 96", "OC 97", "OC 98", "OC 99" (356 values omitted).
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 9796 |
| Median | 0 |
| 1st and 3rd quartiles | 0; 0 |
| Min. and max. | -224187.91; 1435780.16 |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 62 |
| Mode | “un” |
The following values appear with prefixed or suffixed white space: "m2 ", "un ", "Un ".
Note that the following levels have at most five observations: "%", "0", "dia", "gbl", "glb", …, "ton", "ün", "un ", "Visi", "VJ" (17 values omitted).
Note that there might be case problems with the following levels: "di", "DI", "gb", "GB", "gl", …, "Un ", "und", "Und", "vj", "VJ" (29 values omitted).
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 3 |
| Mode | “EJECUCIÓN” |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 3 |
| Mode | “0” |
| Reference category | 0 |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 2 |
| Mode | “SI” |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 2 |
| Mode | “SI” |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 2 |
| Mode | “SI” |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 2 |
| Mode | “SI” |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 2 |
| Mode | “SI” |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 2 |
| Mode | “SI” |
Report generation information:
Created by: RamiroSeb (username:
SEBASTIAN).
Report creation time: dom nov. 02 2025 14:27:49
Report was run from directory:
D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review
dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]
R version 4.5.0 (2025-04-11 ucrt).
Platform: x86_64-w64-mingw32/x64(America/Bogota).
Function call:
makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)
The dataset examined has the following dimensions:
| Feature | Result |
|---|---|
| Number of observations | 0 |
| Number of variables | 8 |
The following variable checks were performed, depending on the data type of each variable:
| character | factor | labelled | haven labelled | numeric | integer | logical | Date | |
|---|---|---|---|---|---|---|---|---|
| Identify miscoded missing values | × | × | × | × | × | × | × | |
| Identify prefixed and suffixed whitespace | × | × | × | × | ||||
| Identify levels with < 6 obs. | × | × | × | × | ||||
| Identify case issues | × | × | × | × | ||||
| Identify misclassified numeric or integer variables | × | × | × | × | ||||
| Identify outliers | × | × | × |
Please note that all numerical values in the following have been rounded to 2 decimals.
| Variable class | # unique values | Missing observations | Any problems? | |
|---|---|---|---|---|
| SkIdNivel | logical | 0 | NaN % | × |
| SkIdEmpresa | logical | 0 | NaN % | × |
| Codigo.Proyecto | logical | 0 | NaN % | × |
| Id.Nivel | logical | 0 | NaN % | × |
| Descripcion.Nivel | logical | 0 | NaN % | × |
| Nivel.Auxiliar | logical | 0 | NaN % | × |
| Orden | logical | 0 | NaN % | × |
| Empresa | logical | 0 | NaN % | × |
The variable is a key (distinct values for each observation).
The variable only takes one value: "NA".
The variable is a key (distinct values for each observation).
The variable only takes one value: "NA".
The variable is a key (distinct values for each observation).
The variable only takes one value: "NA".
The variable is a key (distinct values for each observation).
The variable only takes one value: "NA".
The variable is a key (distinct values for each observation).
The variable only takes one value: "NA".
The variable is a key (distinct values for each observation).
The variable only takes one value: "NA".
The variable is a key (distinct values for each observation).
The variable only takes one value: "NA".
The variable is a key (distinct values for each observation).
The variable only takes one value: "NA".
Report generation information:
Created by: RamiroSeb (username:
SEBASTIAN).
Report creation time: dom nov. 02 2025 14:29:14
Report was run from directory:
D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review
dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]
R version 4.5.0 (2025-04-11 ucrt).
Platform: x86_64-w64-mingw32/x64(America/Bogota).
Function call:
makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)
The dataset examined has the following dimensions:
| Feature | Result |
|---|---|
| Number of observations | 7 |
| Number of variables | 2 |
The following variable checks were performed, depending on the data type of each variable:
| character | factor | labelled | haven labelled | numeric | integer | logical | Date | |
|---|---|---|---|---|---|---|---|---|
| Identify miscoded missing values | × | × | × | × | × | × | × | |
| Identify prefixed and suffixed whitespace | × | × | × | × | ||||
| Identify levels with < 6 obs. | × | × | × | × | ||||
| Identify case issues | × | × | × | × | ||||
| Identify misclassified numeric or integer variables | × | × | × | × | ||||
| Identify outliers | × | × | × |
Please note that all numerical values in the following have been rounded to 2 decimals.
| Variable class | # unique values | Missing observations | Any problems? | |
|---|---|---|---|---|
| SkIdOrigenDelDocumento | integer | 7 | 0.00 % | |
| Descripcion | character | 7 | 0.00 % | × |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 7 |
| Median | 3 |
| 1st and 3rd quartiles | 1.5; 4.5 |
| Min. and max. | 0; 6 |
Report generation information:
Created by: RamiroSeb (username:
SEBASTIAN).
Report creation time: dom nov. 02 2025 14:29:16
Report was run from directory:
D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review
dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]
R version 4.5.0 (2025-04-11 ucrt).
Platform: x86_64-w64-mingw32/x64(America/Bogota).
Function call:
makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)
The dataset examined has the following dimensions:
| Feature | Result |
|---|---|
| Number of observations | 89 |
| Number of variables | 37 |
The following variable checks were performed, depending on the data type of each variable:
| character | factor | labelled | haven labelled | numeric | integer | logical | Date | |
|---|---|---|---|---|---|---|---|---|
| Identify miscoded missing values | × | × | × | × | × | × | × | |
| Identify prefixed and suffixed whitespace | × | × | × | × | ||||
| Identify levels with < 6 obs. | × | × | × | × | ||||
| Identify case issues | × | × | × | × | ||||
| Identify misclassified numeric or integer variables | × | × | × | × | ||||
| Identify outliers | × | × | × |
Please note that all numerical values in the following have been rounded to 2 decimals.
| Variable class | # unique values | Missing observations | Any problems? | |
|---|---|---|---|---|
| SkIdProyecto | integer | 89 | 0.00 % | × |
| Codigo.Proyecto | integer | 89 | 0.00 % | |
| Nombre.Proyecto | character | 89 | 0.00 % | × |
| Clase.Proyecto | character | 4 | 0.00 % | × |
| Tipo | character | 2 | 0.00 % | × |
| Estado | character | 3 | 0.00 % | |
| Presupuesto.Fijo | character | 2 | 0.00 % | |
| Propietario | character | 4 | 0.00 % | × |
| Sucursal | integer | 89 | 0.00 % | |
| Sucursal.Nombre | character | 89 | 0.00 % | × |
| MacroProyecto | integer | 11 | 52.81 % | × |
| MacroProyecto.Descripcion | character | 11 | 0.00 % | × |
| Centro.Costo | integer | 74 | 0.00 % | × |
| Centro.Costo.Descripcion | character | 73 | 0.00 % | × |
| VIS | character | 2 | 0.00 % | |
| Sucursal.Administrativa | character | 1 | 0.00 % | × |
| SkIdEmpresa | integer | 1 | 0.00 % | × |
| Empresa | character | 1 | 0.00 % | × |
| Fecha.De.Elaboracion | character | 59 | 0.00 % | × |
| Fecha.De.Inicio | character | 61 | 0.00 % | × |
| Fecha.De.Finalizacion | character | 61 | 0.00 % | × |
| UnidadAConstruir | character | 4 | 0.00 % | × |
| CantidadAConstruir | numeric | 12 | 0.00 % | × |
| AreaAConstruir_M2 | numeric | 23 | 0.00 % | × |
| AreaConstruidaFinal_M2 | numeric | 13 | 1.12 % | × |
| AreaAVender_M2 | numeric | 4 | 0.00 % | × |
| Arealote_M2 | numeric | 5 | 1.12 % | × |
| CostoPreFactibilidad | numeric | 2 | 2.25 % | × |
| Iniciales | logical | 1 | 100.00 % | × |
| Nocontrato | logical | 1 | 100.00 % | × |
| Cliente | character | 12 | 0.00 % | × |
| Inversionista | character | 15 | 0.00 % | × |
| Almacenista | character | 14 | 0.00 % | × |
| PorcentajeAdministracion | numeric | 3 | 0.00 % | × |
| PorcentajeImprevistos | numeric | 3 | 0.00 % | × |
| PorcentajeUtilidad | numeric | 3 | 0.00 % | × |
| IVA | numeric | 3 | 0.00 % | × |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 89 |
| Median | 100204 |
| 1st and 3rd quartiles | 100128; 100255 |
| Min. and max. | 1003; 100295 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 89 |
| Median | 204 |
| 1st and 3rd quartiles | 128; 255 |
| Min. and max. | 3; 295 |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 4 |
| Mode | “Admon Delegada sin Representacion” |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 2 |
| Mode | “ADPRO” |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 3 |
| Mode | “Finalizado” |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 2 |
| Mode | “NO” |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 4 |
| Mode | “SIN PROPIETARIO” |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 89 |
| Median | 204 |
| 1st and 3rd quartiles | 128; 255 |
| Min. and max. | 3; 295 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 47 (52.81 %) |
| Number of unique values | 10 |
| Median | 14 |
| 1st and 3rd quartiles | 9.25; 103 |
| Min. and max. | 1; 107 |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 11 |
| Mode | “” |
The following suspected missing value codes enter as regular values: "".
Note that the following levels have at most five observations: "Caminos de Sie - Manzana 2", "Caminos de Sie - Manzana 4", "Centro Cultural Atrio", "Hotel Four Seasons San Francisco", "Quadro Smart Living", "Valverde Ciprés", "Valverde Roble".
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 74 |
| Median | 2308307 |
| 1st and 3rd quartiles | 2304409; 2312414 |
| Min. and max. | 2200101; 10388801 |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 73 |
| Mode | “Edificaciones Manz 1” |
The following values appear with prefixed or suffixed white space: "Cipres Directos ", "Cipres Locales NO Vis ", "Costos compartidos Dts Arboleda ", "Directos San Francisco ", "Dosel del Bosque Piscilago ", …, "Plataforma Arboleda Vis ", "Roble Directos ", "Roble Locales NO Vis ", "Urbanismo Externo Arboleda Vis ", "Vive 92 NQS Directos " (4 values omitted).
Note that the following levels have at most five observations: "Acabados Fase I CC UAandes", "Centro Civico Univ.Andes", "Cipres Directos ", "Cipres Locales NO Vis ", "Costos compartidos Dts Arboleda ", …, "Urban Interno Manz 4", "Urban P.Principal", "Urbanismo Externo Arboleda Vis ", "Vive 92 NQS Directos ", "VV Nogal Directos" (61 values omitted).
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 2 |
| Mode | “NO” |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 59 |
| Mode | “03/02/2015” |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 61 |
| Mode | “03/10/2022” |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 61 |
| Mode | “03/02/2017” |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 4 |
| Mode | “m2” |
The following suspected missing value codes enter as regular values: "".
Note that the following levels have at most five observations: "M2", "un".
Note that there might be case problems with the following levels: "m2", "M2".
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 12 |
| Median | 1 |
| 1st and 3rd quartiles | 0; 1 |
| Min. and max. | 0; 540 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 23 |
| Median | 1 |
| 1st and 3rd quartiles | 0; 1 |
| Min. and max. | 0; 40996 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 1 (1.12 %) |
| Number of unique values | 12 |
| Median | 0 |
| 1st and 3rd quartiles | 0; 1 |
| Min. and max. | 0; 21904.02 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 4 |
| Mode | “0” |
| Reference category | 0 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 1 (1.12 %) |
| Number of unique values | 4 |
| Mode | “0” |
| Reference category | 0 |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 12 |
| Mode | “” |
The following suspected missing value codes enter as regular values: "".
Note that the following levels have at most five observations: "Arpro", "ARPRO ARQUITECTOS INGENIERSO S.A.", "ARPRO INGENIEROS ARQUITECTOS S.A.", "Caja de Compensación Colsubsidio", "CANPACK COLOMBIA SAS", "INVERSIONES FAMOSO", "Pontificia Universidad Javieriana", "QBO", "Solution Investment S.A.S.", "Universidad de los Andes".
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 15 |
| Mode | “” |
The following suspected missing value codes enter as regular values: "".
The following values appear with prefixed or suffixed white space: "ARPRO ", "ARPRO Arquitectos Ingenieros S.A. ".
Note that the following levels have at most five observations: "Arpro", "ARPRO ", "Arpro Arquitecto Ingenieros S.A.", "ARPRO Arquitectos Ingenieros S.A. ", "Chaid Neme", …, "INVERSIONES ARPRO PROVI SAS", "Prominsa", "PROMINSA LTDA", "SOMEC", "Uniandes" (2 values omitted).
Note that there might be case problems with the following levels: "Arpro Arquitectos Ingenieros S.A.", "ARPRO Arquitectos Ingenieros S.A.".
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 14 |
| Mode | “” |
The following suspected missing value codes enter as regular values: "".
Note that the following levels have at most five observations: "Andrea Pedraza", "Carlos Medina", "Fredy Ortiz", "Isidro González", "Juan Carlos Chaparro", …, "Orlando Camacho", "Oscar Fernando Pulido", "Oscar Marca", "Rafael Rodriguez", "Rafael Rodríguez" (1 values omitted).
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 3 |
| Mode | “0” |
| Reference category | 0 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 3 |
| Mode | “0” |
| Reference category | 0 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 3 |
| Mode | “0” |
| Reference category | 0 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 3 |
| Mode | “16” |
| Reference category | 0 |
Report generation information:
Created by: RamiroSeb (username:
SEBASTIAN).
Report creation time: dom nov. 02 2025 14:29:18
Report was run from directory:
D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review
dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]
R version 4.5.0 (2025-04-11 ucrt).
Platform: x86_64-w64-mingw32/x64(America/Bogota).
Function call:
makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)
The dataset examined has the following dimensions:
| Feature | Result |
|---|---|
| Number of observations | 12709 |
| Number of variables | 19 |
The following variable checks were performed, depending on the data type of each variable:
| character | factor | labelled | haven labelled | numeric | integer | logical | Date | |
|---|---|---|---|---|---|---|---|---|
| Identify miscoded missing values | × | × | × | × | × | × | × | |
| Identify prefixed and suffixed whitespace | × | × | × | × | ||||
| Identify levels with < 6 obs. | × | × | × | × | ||||
| Identify case issues | × | × | × | × | ||||
| Identify misclassified numeric or integer variables | × | × | × | × | ||||
| Identify outliers | × | × | × |
Please note that all numerical values in the following have been rounded to 2 decimals.
| Variable class | # unique values | Missing observations | Any problems? | |
|---|---|---|---|---|
| SkIdTercero | integer | 12709 | 0.00 % | |
| Nombre | character | 12673 | 0.00 % | × |
| Nit | character | 12709 | 0.00 % | × |
| Contacto | character | 3471 | 1.10 % | × |
| character | 4235 | 0.00 % | × | |
| Direccion | character | 11858 | 0.00 % | × |
| Telefono | character | 10836 | 0.00 % | × |
| Tipo | character | 4 | 0.00 % | × |
| Plazo.de.pago | numeric | 11 | 0.01 % | × |
| Ciudad | character | 146 | 0.00 % | × |
| CIIU.Cod | integer | 497 | 37.07 % | × |
| CIIU | character | 496 | 0.00 % | × |
| Estado | character | 3 | 0.00 % | × |
| Especialidad | logical | 1 | 100.00 % | × |
| Categoria | logical | 1 | 100.00 % | × |
| Grupo | logical | 1 | 100.00 % | × |
| Calificacion | logical | 1 | 100.00 % | × |
| Cargo | character | 851 | 0.00 % | × |
| Naturaleza | character | 3 | 0.00 % | × |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 12709 |
| Median | 6354 |
| 1st and 3rd quartiles | 3177; 9531 |
| Min. and max. | -1; 12708 |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 12673 |
| Mode | “360 GRADOS AGENCIA CREATIVA SAS” |
The following values appear with prefixed or suffixed white space: " AIRALO INC", " CANTATA SA", " CVS PHARMACY", " TRADER JOES", " WHOLE FOODS MARKET INC", …, "ZABALETA BLADIMIR ", "ZERDA ESGUERRA DIEGO LIBARDO ", "ZORRO GUTIERREZ MODESTO ", "ZULUAGA BOTERO JORGE MAURICIO ", "ZULUAGA DUQUE RAMON EUSEBIO " (1310 values omitted).
Note that the following levels have at most five observations: " AIRALO INC", " CANTATA SA", " CVS PHARMACY", " TRADER JOES", " WHOLE FOODS MARKET INC", …, "ZUÑIGA SANCHEZ ALEJANDRA MARIA", "ZURICH COLOMBIA SEGUROS S.A.", "ZURICH COLOMBIA SEGUROS SA", "ZURITA GUTIERREZ ALFONSO RAFAEL", "ZYCOL LTDA" (12663 values omitted).
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 140 (1.1 %) |
| Number of unique values | 3470 |
| Mode | “” |
The following suspected missing value codes enter as regular values: "".
The following values appear with prefixed or suffixed white space: " CAROLINA GUALTERO URREGO", " CASTRO ANA MARIA", " FRANCIA ACOSTA MARTÍNEZ", " GINA PAOLA CUBILLOS PULIDO", "4311-4312-4923 ", …, "YEINS SMITH ", "YENNIFER ZAPATA ALZATE ", "YHEEFRY ENRIQUEZ SUAREZ ", "YOLANDA SANTAMARÍA ARDILA ", "ZARATE JORGE " (574 values omitted).
Note that the following levels have at most five observations: " CAROLINA GUALTERO URREGO", " CASTRO ANA MARIA", " FRANCIA ACOSTA MARTÍNEZ", " GINA PAOLA CUBILLOS PULIDO", ",ARIA CLEMENCIA GOMEZ PRIETO", …, "ZARATE PARRA ORLANDO", "ZENELIA GIRALDO", "ZULEIMA MARTINEZ", "ZULEYMY MENDEZ", "ZULUAGA REINA JULIANA" (3453 values omitted).
Note that there might be case problems with the following levels: "claudia carolina castrillo galvis", "CLAUDIA CAROLINA CASTRILLO GALVIS", "Juan Camilo Martin Herrera", "JUAN CAMILO MARTIN HERRERA".
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 4235 |
| Mode | “” |
The following suspected missing value codes enter as regular values: "".
The following values appear with prefixed or suffixed white space: " analista.financiera@servimeters.com", " certificados@ifxcorp.com", " dotacionesnw@gmail.com", " UNIT 1-834 SQUARE ON SHOPPING CENTRE MISSISAUGA", "admin@mpmbtl.com ", …, "tbermeo@guidepostsolutions.com ", "tesoreria@liftlogicco.com ", "tesorerria@potenco.com.co ", "transestrellasas@outlook.com ", "ventas@atinstrumentos.com " (60 values omitted).
Note that the following levels have at most five observations: " analista.financiera@servimeters.com", " certificados@ifxcorp.com", " dotacionesnw@gmail.com", " UNIT 1-834 SQUARE ON SHOPPING CENTRE MISSISAUGA", "1004 MIDDLEGATE, ON L4Y1M4, CANADA", …, "zebasgg97@gmail.com", "zetarsas@hotmail.com", "zinetikom@gmail.com", "zmartinez@canwindows.com", "zulyth2008@hotmail.com" (4210 values omitted).
Note that there might be case problems with the following levels: "caja menor maquinaria", "CAJA MENOR MAQUINARIA", "lc_construccion@hotmail.com", "LC_CONSTRUCCION@hotmail.com".
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 11858 |
| Mode | “0” |
The following suspected missing value codes enter as regular values: "".
The following values appear with prefixed or suffixed white space: " AD ", " AER PALMIRA", " CL DEL CHORRITO ", " CONDOMINIO ALEJANDRIA AP L-302", " CONJUNTO TORRES DEL", …, "VRDA BOJACA ", "VRE 1 1 1 ", "VTE TOCAIMA EN TERMINAL ", "WORD TRADE CENTER ", "ZARAGOCILLA CALLE EL PORVENIR N. 49-80 " (2231 values omitted).
Note that the following levels have at most five observations: "", " AD ", " AER PALMIRA", " CL DEL CHORRITO ", " CONDOMINIO ALEJANDRIA AP L-302", …, "ZN 249 USA", "ZN 767 SUIZA", "ZN 767 Zurich", "ZN TX 77056 1360", "ZN ZF PERMANENTE DEL CAUCA ET1 LOTE 4" (11823 values omitted).
Note that there might be case problems with the following levels: "calle 98 8 28 of602", "CALLE 98 8 28 OF602", "CL 84 28b 95", "CL 84 28B 95", "CL 98 8 28 of 602", …, "CONDOMINIO CHORLAVI CS 1 SECTOR EL TIGRE", "CR 13a 90 21 OF 203", "CR 13A 90 21 OF 203", "cra 12-79-50", "CRA 12-79-50" (4 values omitted).
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 10836 |
| Mode | “6010404” |
The following suspected missing value codes enter as regular values: "", " ", ".".
The following values appear with prefixed or suffixed white space: " ", " 6718605", " 111", " 2465662 - 2171887", " 5311155 ", …, "9071246 ", "9098650 ", "9236046 ", "9260995 ", "CEL 311 - 5730404 " (403 values omitted).
Note that the following levels have at most five observations: " ", " 6718605", " 111", " 2465662 - 2171887", " 5311155 ", …, "981179299", "981396252", "981814873", "CEL 311 - 5730404 ", "NO HAY" (10816 values omitted).
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 4 |
| Mode | “P” |
The following suspected missing value codes enter as regular values: "".
Note that the following levels have at most five observations: "".
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 1 (0.01 %) |
| Number of unique values | 10 |
| Median | 1 |
| 1st and 3rd quartiles | 1; 30 |
| Min. and max. | 0; 90 |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 146 |
| Mode | “BOGOTÁ D.C.” |
The following suspected missing value codes enter as regular values: "".
Note that the following levels have at most five observations: "", "AGUA DE DIOS", "AGUACHICA", "AIPE", "ANAPOIMA", …, "YOPAL", "ZETAQUIRA", "ZIPACON", "ZOETERMEER", "ZONA BANANERA" (93 values omitted).
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 4711 (37.07 %) |
| Number of unique values | 496 |
| Median | 7730 |
| 1st and 3rd quartiles | 4663; 9900155 |
| Min. and max. | 10; 990074221 |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 496 |
| Mode | “” |
The following suspected missing value codes enter as regular values: "".
Note that the following levels have at most five observations: "Acabado de productos textiles.", "Actividades combinadas de apoyo a instalaciones.", "Actividades de administración de fondos.", "Actividades de aeropuertos, servicios de navegación aérea y demás actividades conexas al transporte aéreo.", "Actividades de agentes y corredores de seguros", …, "Transporte urbano colectivo regular de pasajeros", "Tratamiento y disposición de desechos no peligrosos.", "Tratamiento y disposición de desechos peligrosos.", "Tratamiento y revestimiento de metales; mecanizado.", "Tratamiento y revestimiento de metales; trabajos de ingeniería mecánica en general realizados a cambio de una retribución o por contrata" (303 values omitted).
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 3 |
| Mode | “Activo” |
The following suspected missing value codes enter as regular values: "".
Note that the following levels have at most five observations: "".
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 851 |
| Mode | “” |
The following suspected missing value codes enter as regular values: "".
The following values appear with prefixed or suffixed white space: " CARTERA", "ABOGADA ", "ADMINISTRACION ", "ADMINISTRADOR ", "ADMINISTRADOR DE PROYECTOS ", …, "TESORERA ", "TESORERIA ", "TESORERO ", "VENDEDOR ", "VENTAS CONSTRUCTOR " (205 values omitted).
Note that the following levels have at most five observations: " CARTERA", "3144578699", "6720287", "ABOGADA ", "ABOGADO", …, "VENDEDOR / GERENTE", "VENDEDORA", "VENTAS", "VENTAS CONSTRUCTOR ", "Vicepresidente de Carteras colectivas" (762 values omitted).
Note that there might be case problems with the following levels: "Administrador", "ADMINISTRADOR", "Administradora ", "ADMINISTRADORA ", "analista contabilidad", …, "REPRESENTANTE LEGAL", "Tesoreria", "TESORERIA", "Vendedor", "VENDEDOR" (56 values omitted).
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 3 |
| Mode | “J” |
The following suspected missing value codes enter as regular values: "".
Note that the following levels have at most five observations: "".
Report generation information:
Created by: RamiroSeb (username:
SEBASTIAN).
Report creation time: dom nov. 02 2025 14:29:31
Report was run from directory:
D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review
dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]
R version 4.5.0 (2025-04-11 ucrt).
Platform: x86_64-w64-mingw32/x64(America/Bogota).
Function call:
makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)
The dataset examined has the following dimensions:
| Feature | Result |
|---|---|
| Number of observations | 64 |
| Number of variables | 5 |
The following variable checks were performed, depending on the data type of each variable:
| character | factor | labelled | haven labelled | numeric | integer | logical | Date | |
|---|---|---|---|---|---|---|---|---|
| Identify miscoded missing values | × | × | × | × | × | × | × | |
| Identify prefixed and suffixed whitespace | × | × | × | × | ||||
| Identify levels with < 6 obs. | × | × | × | × | ||||
| Identify case issues | × | × | × | × | ||||
| Identify misclassified numeric or integer variables | × | × | × | × | ||||
| Identify outliers | × | × | × |
Please note that all numerical values in the following have been rounded to 2 decimals.
| Variable class | # unique values | Missing observations | Any problems? | |
|---|---|---|---|---|
| SkIdTipoContrato | integer | 64 | 0.00 % | |
| SkIdEmpresa | integer | 1 | 0.00 % | × |
| Tipo.Codigo | character | 64 | 0.00 % | × |
| Tipo.Descripcion | character | 64 | 0.00 % | × |
| Empresa | character | 1 | 0.00 % | × |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 64 |
| Median | 32.5 |
| 1st and 3rd quartiles | 16.75; 48.25 |
| Min. and max. | 1; 64 |
Report generation information:
Created by: RamiroSeb (username:
SEBASTIAN).
Report creation time: dom nov. 02 2025 14:30:20
Report was run from directory:
D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review
dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]
R version 4.5.0 (2025-04-11 ucrt).
Platform: x86_64-w64-mingw32/x64(America/Bogota).
Function call:
makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)
The dataset examined has the following dimensions:
| Feature | Result |
|---|---|
| Number of observations | 3 |
| Number of variables | 2 |
The following variable checks were performed, depending on the data type of each variable:
| character | factor | labelled | haven labelled | numeric | integer | logical | Date | |
|---|---|---|---|---|---|---|---|---|
| Identify miscoded missing values | × | × | × | × | × | × | × | |
| Identify prefixed and suffixed whitespace | × | × | × | × | ||||
| Identify levels with < 6 obs. | × | × | × | × | ||||
| Identify case issues | × | × | × | × | ||||
| Identify misclassified numeric or integer variables | × | × | × | × | ||||
| Identify outliers | × | × | × |
Please note that all numerical values in the following have been rounded to 2 decimals.
| Variable class | # unique values | Missing observations | Any problems? | |
|---|---|---|---|---|
| SkIdTipoDescuento | integer | 3 | 0.00 % | × |
| Descripcion | character | 3 | 0.00 % | × |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 3 |
| Mode | “1” |
| Reference category | 1 |
Report generation information:
Created by: RamiroSeb (username:
SEBASTIAN).
Report creation time: dom nov. 02 2025 14:30:21
Report was run from directory:
D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review
dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]
R version 4.5.0 (2025-04-11 ucrt).
Platform: x86_64-w64-mingw32/x64(America/Bogota).
Function call:
makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)
The dataset examined has the following dimensions:
| Feature | Result |
|---|---|
| Number of observations | 11 |
| Number of variables | 3 |
The following variable checks were performed, depending on the data type of each variable:
| character | factor | labelled | haven labelled | numeric | integer | logical | Date | |
|---|---|---|---|---|---|---|---|---|
| Identify miscoded missing values | × | × | × | × | × | × | × | |
| Identify prefixed and suffixed whitespace | × | × | × | × | ||||
| Identify levels with < 6 obs. | × | × | × | × | ||||
| Identify case issues | × | × | × | × | ||||
| Identify misclassified numeric or integer variables | × | × | × | × | ||||
| Identify outliers | × | × | × |
Please note that all numerical values in the following have been rounded to 2 decimals.
| Variable class | # unique values | Missing observations | Any problems? | |
|---|---|---|---|---|
| SkIdTipoPoliza | integer | 11 | 0.00 % | |
| SkIdEmpresa | integer | 1 | 0.00 % | × |
| Descripcion | character | 11 | 0.00 % | × |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 11 |
| Median | 1006 |
| 1st and 3rd quartiles | 1003.5; 10036.5 |
| Min. and max. | 1001; 10039 |
Report generation information:
Created by: RamiroSeb (username:
SEBASTIAN).
Report creation time: dom nov. 02 2025 14:30:23
Report was run from directory:
D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review
dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]
R version 4.5.0 (2025-04-11 ucrt).
Platform: x86_64-w64-mingw32/x64(America/Bogota).
Function call:
makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)
The dataset examined has the following dimensions:
| Feature | Result |
|---|---|
| Number of observations | 421 |
| Number of variables | 7 |
The following variable checks were performed, depending on the data type of each variable:
| character | factor | labelled | haven labelled | numeric | integer | logical | Date | |
|---|---|---|---|---|---|---|---|---|
| Identify miscoded missing values | × | × | × | × | × | × | × | |
| Identify prefixed and suffixed whitespace | × | × | × | × | ||||
| Identify levels with < 6 obs. | × | × | × | × | ||||
| Identify case issues | × | × | × | × | ||||
| Identify misclassified numeric or integer variables | × | × | × | × | ||||
| Identify outliers | × | × | × |
Please note that all numerical values in the following have been rounded to 2 decimals.
| Variable class | # unique values | Missing observations | Any problems? | |
|---|---|---|---|---|
| SkIdUsuario | integer | 421 | 0.00 % | × |
| SkIdEmpresa | integer | 2 | 0.00 % | × |
| Nombre | character | 421 | 0.00 % | × |
| Cargo | character | 177 | 0.00 % | × |
| Nivel.Acceso | character | 54 | 0.00 % | × |
| Estado | character | 3 | 0.00 % | × |
| Empresa | character | 2 | 0.00 % | × |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 421 |
| Median | 100304 |
| 1st and 3rd quartiles | 100199; 100409 |
| Min. and max. | 0; 100514 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 2 |
| Mode | “100” |
| Reference category | 0 |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 177 |
| Mode | “Residente de Obra” |
The following suspected missing value codes enter as regular values: "".
The following values appear with prefixed or suffixed white space: "Administrador de Obra ", "Auxiliar de compras No1 ", "Director Obra ", "Gerente ", "Gerente de Proyectos ", "Residente de Obra ".
Note that the following levels have at most five observations: "Admin1", "Administrador", "Administrador de Obra ", "Administrador de proyecto", "Administrador Obra Engativa", …, "Residente Provisional de Obra", "Revisor Fiscal", "Seguridad Informatica", "SIN CARGO", "Supervisor" (155 values omitted).
Note that there might be case problems with the following levels: "Asistente contable", "Asistente Contable", "Auxiliar Control de costos", "Auxiliar Control de Costos", "Director de compras", …, "Gerente de Proyecto", "interventor", "Interventor", "n/a", "N/A" (2 values omitted).
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 54 |
| Mode | “Interventor” |
The following values appear with prefixed or suffixed white space: "Sandra Mireya ".
Note that the following levels have at most five observations: "Acceso Total", "Administrador", "Analista", "Analista Financiero", "Andrea Rada", …, "Revisoría Fiscal", "Sandra Leon", "Seguridad Informatica NH", "SIN NIVEL", "Soporte Arpro" (26 values omitted).
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 3 |
| Mode | “ACTIVO” |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 2 |
| Mode | “ARPRO ARQUITECTOS INGENIEROS S.A.S” |
Report generation information:
Created by: RamiroSeb (username:
SEBASTIAN).
Report creation time: dom nov. 02 2025 14:30:26
Report was run from directory:
D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review
dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]
R version 4.5.0 (2025-04-11 ucrt).
Platform: x86_64-w64-mingw32/x64(America/Bogota).
Function call:
makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)
The dataset examined has the following dimensions:
| Feature | Result |
|---|---|
| Number of observations | 17 |
| Number of variables | 5 |
The following variable checks were performed, depending on the data type of each variable:
| character | factor | labelled | haven labelled | numeric | integer | logical | Date | |
|---|---|---|---|---|---|---|---|---|
| Identify miscoded missing values | × | × | × | × | × | × | × | |
| Identify prefixed and suffixed whitespace | × | × | × | × | ||||
| Identify levels with < 6 obs. | × | × | × | × | ||||
| Identify case issues | × | × | × | × | ||||
| Identify misclassified numeric or integer variables | × | × | × | × | ||||
| Identify outliers | × | × | × |
Please note that all numerical values in the following have been rounded to 2 decimals.
| Variable class | # unique values | Missing observations | Any problems? | |
|---|---|---|---|---|
| SkIdVariablesAdicionalesContratos | integer | 10 | 0.00 % | |
| SkIdEmpresa | integer | 1 | 0.00 % | × |
| Variable.configurada | character | 2 | 0.00 % | |
| Respuesta.de.variable | character | 4 | 0.00 % | × |
| Empresa | character | 1 | 0.00 % | × |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 10 |
| Median | 1002010040 |
| 1st and 3rd quartiles | 1001860330; 1002360046 |
| Min. and max. | 1001190044; 1002750087 |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 2 |
| Mode | “Fecha Ultima Propuesta” |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 4 |
| Mode | “” |
The following suspected missing value codes enter as regular values: "".
Note that the following levels have at most five observations: "Construcción de edificios residenciales (4111), Construcción de edificios no residenciales (4112).", "Diciembre de 2018", "Enero 6 de 2020".
Report generation information:
Created by: RamiroSeb (username:
SEBASTIAN).
Report creation time: dom nov. 02 2025 14:30:29
Report was run from directory:
D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review
dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]
R version 4.5.0 (2025-04-11 ucrt).
Platform: x86_64-w64-mingw32/x64(America/Bogota).
Function call:
makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)
The dataset examined has the following dimensions:
| Feature | Result |
|---|---|
| Number of observations | 0 |
| Number of variables | 5 |
The following variable checks were performed, depending on the data type of each variable:
| character | factor | labelled | haven labelled | numeric | integer | logical | Date | |
|---|---|---|---|---|---|---|---|---|
| Identify miscoded missing values | × | × | × | × | × | × | × | |
| Identify prefixed and suffixed whitespace | × | × | × | × | ||||
| Identify levels with < 6 obs. | × | × | × | × | ||||
| Identify case issues | × | × | × | × | ||||
| Identify misclassified numeric or integer variables | × | × | × | × | ||||
| Identify outliers | × | × | × |
Please note that all numerical values in the following have been rounded to 2 decimals.
| Variable class | # unique values | Missing observations | Any problems? | |
|---|---|---|---|---|
| SkIdVariablesAdicionalesContratos | logical | 0 | NaN % | × |
| SkIdEmpresa | logical | 0 | NaN % | × |
| Variable.configurada | logical | 0 | NaN % | × |
| Respuesta.de.variable | logical | 0 | NaN % | × |
| Empresa | logical | 0 | NaN % | × |
The variable is a key (distinct values for each observation).
The variable only takes one value: "NA".
The variable is a key (distinct values for each observation).
The variable only takes one value: "NA".
The variable is a key (distinct values for each observation).
The variable only takes one value: "NA".
The variable is a key (distinct values for each observation).
The variable only takes one value: "NA".
The variable is a key (distinct values for each observation).
The variable only takes one value: "NA".
Report generation information:
Created by: RamiroSeb (username:
SEBASTIAN).
Report creation time: dom nov. 02 2025 14:30:32
Report was run from directory:
D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review
dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]
R version 4.5.0 (2025-04-11 ucrt).
Platform: x86_64-w64-mingw32/x64(America/Bogota).
Function call:
makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)
The dataset examined has the following dimensions:
| Feature | Result |
|---|---|
| Number of observations | 1 |
| Number of variables | 3 |
The following variable checks were performed, depending on the data type of each variable:
| character | factor | labelled | haven labelled | numeric | integer | logical | Date | |
|---|---|---|---|---|---|---|---|---|
| Identify miscoded missing values | × | × | × | × | × | × | × | |
| Identify prefixed and suffixed whitespace | × | × | × | × | ||||
| Identify levels with < 6 obs. | × | × | × | × | ||||
| Identify case issues | × | × | × | × | ||||
| Identify misclassified numeric or integer variables | × | × | × | × | ||||
| Identify outliers | × | × | × |
Please note that all numerical values in the following have been rounded to 2 decimals.
| Variable class | # unique values | Missing observations | Any problems? | |
|---|---|---|---|---|
| SkIdZona | integer | 1 | 0.00 % | × |
| Zona | character | 1 | 0.00 % | × |
| ProyectoDescripcion | character | 1 | 0.00 % | × |
The variable is a key (distinct values for each observation).
The variable only takes one (non-missing) value: "Zona Valverdes". The variable contains 0 % missing observations.
The variable is a key (distinct values for each observation).
The variable only takes one (non-missing) value: "217 Valverde - Palma". The variable contains 0 % missing observations.
Report generation information:
Created by: RamiroSeb (username:
SEBASTIAN).
Report creation time: dom nov. 02 2025 14:30:34
Report was run from directory:
D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review
dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]
R version 4.5.0 (2025-04-11 ucrt).
Platform: x86_64-w64-mingw32/x64(America/Bogota).
Function call:
makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)
The dataset examined has the following dimensions:
| Feature | Result |
|---|---|
| Number of observations | 199600 |
| Number of variables | 27 |
The following variable checks were performed, depending on the data type of each variable:
| character | factor | labelled | haven labelled | numeric | integer | logical | Date | |
|---|---|---|---|---|---|---|---|---|
| Identify miscoded missing values | × | × | × | × | × | × | × | |
| Identify prefixed and suffixed whitespace | × | × | × | × | ||||
| Identify levels with < 6 obs. | × | × | × | × | ||||
| Identify case issues | × | × | × | × | ||||
| Identify misclassified numeric or integer variables | × | × | × | × | ||||
| Identify outliers | × | × | × |
Please note that all numerical values in the following have been rounded to 2 decimals.
| Variable class | # unique values | Missing observations | Any problems? | |
|---|---|---|---|---|
| SkIdEmpresa | integer | 1 | 0.00 % | × |
| SkIdProyecto | integer | 80 | 0.00 % | × |
| SkIdFecha | integer | 4251 | 0.00 % | × |
| SkIdEstado | integer | 7 | 0.00 % | × |
| SkIdInsumo | integer | 5437 | 0.00 % | × |
| SkIdItems | integer | 22778 | 0.00 % | × |
| SkIdEspecificacionActas | numeric | 55896 | 0.00 % | × |
| SkIdTercero | integer | 2521 | 0.00 % | × |
| Porcentaje.Anticipo | numeric | 33 | 0.00 % | × |
| Valor.Anticipo | numeric | 2773 | 0.00 % | × |
| Porcentaje.Retencion.Antcipo | numeric | 56 | 0.00 % | |
| Valor.Retencion.Anticipo | numeric | 10166 | 0.00 % | × |
| Porcentaje.Retencion.Garantia | numeric | 12 | 1.47 % | |
| Valor.Retencion.Garantias | numeric | 16682 | 0.00 % | × |
| Valor.Descuentos | numeric | 1962 | 0.00 % | × |
| Valor.Total.Neto | numeric | 34442 | 0.00 % | × |
| Valor.Iva.Total | numeric | 23380 | 0.00 % | × |
| Valor.Total.Acta | numeric | 40464 | 0.00 % | × |
| Cantidad.Acta | numeric | 40010 | 2.90 % | × |
| Valor.Unitario | numeric | 50765 | 2.90 % | × |
| Valor.Iva.Unitario | numeric | 30544 | 2.90 % | × |
| Valor.Total | numeric | 121678 | 2.90 % | × |
| No.Contrato | integer | 11407 | 0.00 % | × |
| Tipo.Acta | character | 4 | 0.00 % | |
| No.Acta | integer | 385 | 0.00 % | × |
| Porcentaje.Retencion.Garantia.Fic | numeric | 3 | 0.17 % | |
| Valor.Garantias.Fic | numeric | 6 | 0.00 % | × |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 80 |
| Median | 100200 |
| 1st and 3rd quartiles | 100108; 100228 |
| Min. and max. | 1003; 100294 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 4251 |
| Median | 20190516 |
| 1st and 3rd quartiles | 20160601; 20230329 |
| Min. and max. | 20010302; 20251031 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 7 |
| Median | 1006200001 |
| 1st and 3rd quartiles | 1006200001; 1006200001 |
| Min. and max. | 10068; 1006200005 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 5437 |
| Median | 1007763 |
| 1st and 3rd quartiles | 1002614; 1008952 |
| Min. and max. | 1000; 10019219 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 22778 |
| Median | 10066896 |
| 1st and 3rd quartiles | 10027329; 10079491 |
| Min. and max. | 1000; 100144964 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 55896 |
| Median | 10021121100028 |
| 1st and 3rd quartiles | 10011111100055; 100118118004729 |
| Min. and max. | 1003300011; 1002492490076239 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 2521 |
| Median | -1 |
| 1st and 3rd quartiles | -1; -1 |
| Min. and max. | -1; 12702 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 33 |
| Median | 0 |
| 1st and 3rd quartiles | 0; 0 |
| Min. and max. | 0; 1 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 2773 |
| Median | 0 |
| 1st and 3rd quartiles | 0; 0 |
| Min. and max. | -17243225600.07; 20736458922.66 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 56 |
| Median | 0 |
| 1st and 3rd quartiles | 0; 0.05 |
| Min. and max. | 0; 1 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 10166 |
| Median | 0 |
| 1st and 3rd quartiles | 0; 5231052.11 |
| Min. and max. | -9e+08; 5911629988.86 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 2926 (1.47 %) |
| Number of unique values | 11 |
| Median | 0 |
| 1st and 3rd quartiles | 0; 0.1 |
| Min. and max. | 0; 0.2 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 16682 |
| Median | 0 |
| 1st and 3rd quartiles | 0; 4104172.73 |
| Min. and max. | -3617808287.34; 3089802283 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 1962 |
| Median | 0 |
| 1st and 3rd quartiles | 0; 0 |
| Min. and max. | 0; 515800414.54 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 34442 |
| Median | 14178947.12 |
| 1st and 3rd quartiles | 1140000; 63202053.38 |
| Min. and max. | -2223600323; 9230105669.28 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 23380 |
| Median | 97449.72 |
| 1st and 3rd quartiles | 0; 515580.04 |
| Min. and max. | -422484061.37; 422484061.37 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 40464 |
| Median | 11900000 |
| 1st and 3rd quartiles | 1276800; 50299488.74 |
| Min. and max. | -17243225600.07; 20736458922.66 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 5798 (2.9 %) |
| Number of unique values | 40009 |
| Median | 3.24 |
| 1st and 3rd quartiles | 1; 53 |
| Min. and max. | -1236734210; 6793438200 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 5798 (2.9 %) |
| Number of unique values | 50764 |
| Median | 80000 |
| 1st and 3rd quartiles | 14288.4; 740000 |
| Min. and max. | -13240606.52; 30420118402.79 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 5798 (2.9 %) |
| Number of unique values | 30543 |
| Median | 150.65 |
| 1st and 3rd quartiles | 0; 4853.26 |
| Min. and max. | 0; 779596794.94 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 5798 (2.9 %) |
| Number of unique values | 121677 |
| Median | 1045786.02 |
| 1st and 3rd quartiles | 168600; 4588964.56 |
| Min. and max. | -2168581138.44; 7087702769.07 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 11407 |
| Median | 2000047 |
| 1st and 3rd quartiles | 1080111; 2280270 |
| Min. and max. | 30001; 2940002 |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 4 |
| Mode | “ACTAS GRUPOS” |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 385 |
| Median | 6 |
| 1st and 3rd quartiles | 2; 12 |
| Min. and max. | 1; 385 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 335 (0.17 %) |
| Number of unique values | 2 |
| Mode | “0” |
| Reference category | 0 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 6 |
| Median | 0 |
| 1st and 3rd quartiles | 0; 0 |
| Min. and max. | 0; 390399.35 |
Report generation information:
Created by: RamiroSeb (username:
SEBASTIAN).
Report creation time: dom nov. 02 2025 14:30:47
Report was run from directory:
D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review
dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]
R version 4.5.0 (2025-04-11 ucrt).
Platform: x86_64-w64-mingw32/x64(America/Bogota).
Function call:
makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)
The dataset examined has the following dimensions:
| Feature | Result |
|---|---|
| Number of observations | 6808 |
| Number of variables | 12 |
The following variable checks were performed, depending on the data type of each variable:
| character | factor | labelled | haven labelled | numeric | integer | logical | Date | |
|---|---|---|---|---|---|---|---|---|
| Identify miscoded missing values | × | × | × | × | × | × | × | |
| Identify prefixed and suffixed whitespace | × | × | × | × | ||||
| Identify levels with < 6 obs. | × | × | × | × | ||||
| Identify case issues | × | × | × | × | ||||
| Identify misclassified numeric or integer variables | × | × | × | × | ||||
| Identify outliers | × | × | × |
Please note that all numerical values in the following have been rounded to 2 decimals.
| Variable class | # unique values | Missing observations | Any problems? | |
|---|---|---|---|---|
| SkIdEmpresa | integer | 1 | 0.00 % | × |
| SkIdProyecto | integer | 67 | 0.00 % | × |
| SkIdTercero | integer | 328 | 0.00 % | |
| SkIdEspecificacionDeContratos | numeric | 982 | 0.00 % | × |
| SkIdEspecificacionActas | numeric | 2238 | 0.00 % | × |
| SkIdItems | integer | 379 | 0.00 % | × |
| SkIdInsumo | integer | 1387 | 0.00 % | × |
| SkIdFecha | integer | 1354 | 0.00 % | |
| SkIdTipoDescuento | integer | 2 | 0.00 % | |
| Valro.Descuento | numeric | 3403 | 0.00 % | × |
| Cantidad.Descuento | numeric | 1136 | 0.00 % | × |
| Total.Descuento | numeric | 4866 | 0.00 % | × |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 67 |
| Median | 100200 |
| 1st and 3rd quartiles | 100108; 100226 |
| Min. and max. | 1003; 100278 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 328 |
| Median | 7292 |
| 1st and 3rd quartiles | 5141; 11938 |
| Min. and max. | 22; 12701 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 982 |
| Median | 1002002000005 |
| 1st and 3rd quartiles | 1001081080029; 1002262260083 |
| Min. and max. | 100330016; 1002782780003 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 2238 |
| Median | 10024024000345 |
| 1st and 3rd quartiles | 10011111100059; 100188188000640 |
| Min. and max. | 1003300163; 1002282280489359 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 379 |
| Median | 10067005 |
| 1st and 3rd quartiles | 10027175; 10083085 |
| Min. and max. | 1002553; 100144871 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 1387 |
| Median | 1004830.5 |
| 1st and 3rd quartiles | 1001957; 1008966 |
| Min. and max. | 100106; 10019101 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 1354 |
| Median | 20191113 |
| 1st and 3rd quartiles | 20160406.75; 20230203 |
| Min. and max. | 20110429; 20251031 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 2 |
| Mode | “1” |
| Reference category | 1 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 3403 |
| Median | 40000 |
| 1st and 3rd quartiles | 5000; 375982 |
| Min. and max. | 0; 8434389076 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 1136 |
| Median | 4 |
| 1st and 3rd quartiles | 1; 30 |
| Min. and max. | 0; 103757824 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 4866 |
| Median | 360000 |
| 1st and 3rd quartiles | 83889.01; 1291389.6 |
| Min. and max. | 0; 224454725.46 |
Report generation information:
Created by: RamiroSeb (username:
SEBASTIAN).
Report creation time: dom nov. 02 2025 14:30:56
Report was run from directory:
D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review
dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]
R version 4.5.0 (2025-04-11 ucrt).
Platform: x86_64-w64-mingw32/x64(America/Bogota).
Function call:
makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)
The dataset examined has the following dimensions:
| Feature | Result |
|---|---|
| Number of observations | 1461 |
| Number of variables | 11 |
The following variable checks were performed, depending on the data type of each variable:
| character | factor | labelled | haven labelled | numeric | integer | logical | Date | |
|---|---|---|---|---|---|---|---|---|
| Identify miscoded missing values | × | × | × | × | × | × | × | |
| Identify prefixed and suffixed whitespace | × | × | × | × | ||||
| Identify levels with < 6 obs. | × | × | × | × | ||||
| Identify case issues | × | × | × | × | ||||
| Identify misclassified numeric or integer variables | × | × | × | × | ||||
| Identify outliers | × | × | × |
Please note that all numerical values in the following have been rounded to 2 decimals.
| Variable class | # unique values | Missing observations | Any problems? | |
|---|---|---|---|---|
| SkIdEmpresa | integer | 1 | 0.00 % | × |
| SkIdProyecto | integer | 66 | 0.00 % | |
| SkIdTercero | integer | 261 | 0.00 % | × |
| SkIdFechaAnticipo | integer | 916 | 0.00 % | |
| SkIdFechaPago | numeric | 596 | 26.97 % | |
| SkIdUsuario | integer | 36 | 0.00 % | × |
| SkIdEstado | integer | 5 | 0.00 % | × |
| Anticipo.Numero | integer | 1360 | 0.00 % | |
| Porcentaje.Amortizado | numeric | 15 | 0.00 % | × |
| Valor.Anticipo | numeric | 1168 | 0.00 % | × |
| Factura | character | 527 | 0.00 % | × |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 66 |
| Median | 100157 |
| 1st and 3rd quartiles | 10031; 100225 |
| Min. and max. | 1003; 100275 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 261 |
| Median | 5061 |
| 1st and 3rd quartiles | 4814; 8624 |
| Min. and max. | 155; 12678 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 916 |
| Median | 20181109 |
| 1st and 3rd quartiles | 20150414; 20230216 |
| Min. and max. | 20110709; 20251027 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 394 (26.97 %) |
| Number of unique values | 595 |
| Median | 20191214 |
| 1st and 3rd quartiles | 20161021; 20231006.5 |
| Min. and max. | 20110728; 20251119 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 36 |
| Median | 100203 |
| 1st and 3rd quartiles | 100149; 100268 |
| Min. and max. | 10070; 100499 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 5 |
| Mode | “10033” |
| Reference category | 10030 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 1360 |
| Median | 924 |
| 1st and 3rd quartiles | 331; 1398 |
| Min. and max. | 1; 1790 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 15 |
| Median | 100 |
| 1st and 3rd quartiles | 100; 100 |
| Min. and max. | 0; 100 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 1168 |
| Median | 24197068 |
| 1st and 3rd quartiles | 4973442.82; 83664448 |
| Min. and max. | -5e+08; 4724216985 |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 527 |
| Mode | “” |
The following suspected missing value codes enter as regular values: "".
The following values appear with prefixed or suffixed white space: " 02 08 - 2021", "42472 ", "46972 ", "AJUSTE ANTICIPO ", "ajuste anticipo por ", …, "PROFORMA ", "PROFORMA No: 0811 - ", "PROFORMA No: 2402 - ", "PROFORMA No: 2709 - ", "Traslado de menta a " (12 values omitted).
Note that the following levels have at most five observations: " 02 08 - 2021", "0001", "01", "010-2023", "010-23", …, "Traslado Terranum", "TrasladoMenta", "VIENE DE ETAPA 1", "WP982024", "YFA-0105" (516 values omitted).
Note that there might be case problems with the following levels: "Cuenta de Cobro No. ", "CUENTA DE COBRO NO. ".
Report generation information:
Created by: RamiroSeb (username:
SEBASTIAN).
Report creation time: dom nov. 02 2025 14:31:01
Report was run from directory:
D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review
dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]
R version 4.5.0 (2025-04-11 ucrt).
Platform: x86_64-w64-mingw32/x64(America/Bogota).
Function call:
makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)
The dataset examined has the following dimensions:
| Feature | Result |
|---|---|
| Number of observations | 46144 |
| Number of variables | 7 |
The following variable checks were performed, depending on the data type of each variable:
| character | factor | labelled | haven labelled | numeric | integer | logical | Date | |
|---|---|---|---|---|---|---|---|---|
| Identify miscoded missing values | × | × | × | × | × | × | × | |
| Identify prefixed and suffixed whitespace | × | × | × | × | ||||
| Identify levels with < 6 obs. | × | × | × | × | ||||
| Identify case issues | × | × | × | × | ||||
| Identify misclassified numeric or integer variables | × | × | × | × | ||||
| Identify outliers | × | × | × |
Please note that all numerical values in the following have been rounded to 2 decimals.
| Variable class | # unique values | Missing observations | Any problems? | |
|---|---|---|---|---|
| SkIdEmpresa | integer | 1 | 0.00 % | × |
| SkIdProyecto | integer | 56 | 0.00 % | × |
| SkIdFecha | integer | 1561 | 0.00 % | |
| SkIdUsuario | integer | 34 | 0.00 % | × |
| No..Contrato | integer | 4989 | 0.00 % | × |
| No..Acta | integer | 385 | 0.00 % | × |
| Descripcion.Estado | character | 7 | 0.00 % |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 56 |
| Median | 100211 |
| 1st and 3rd quartiles | 100201; 100236 |
| Min. and max. | 10029; 100294 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 1561 |
| Median | 20230629 |
| 1st and 3rd quartiles | 20211230; 20240911 |
| Min. and max. | 20191118; 20251031 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 34 |
| Median | 100187 |
| 1st and 3rd quartiles | 100141; 100324 |
| Min. and max. | 100103; 100512 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 4989 |
| Median | 2110065.5 |
| 1st and 3rd quartiles | 2010013; 2360038 |
| Min. and max. | 290185; 2940002 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 385 |
| Median | 4 |
| 1st and 3rd quartiles | 2; 10 |
| Min. and max. | 1; 385 |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 7 |
| Mode | “Programación de Actas” |
Report generation information:
Created by: RamiroSeb (username:
SEBASTIAN).
Report creation time: dom nov. 02 2025 14:31:04
Report was run from directory:
D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review
dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]
R version 4.5.0 (2025-04-11 ucrt).
Platform: x86_64-w64-mingw32/x64(America/Bogota).
Function call:
makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)
The dataset examined has the following dimensions:
| Feature | Result |
|---|---|
| Number of observations | 41595 |
| Number of variables | 6 |
The following variable checks were performed, depending on the data type of each variable:
| character | factor | labelled | haven labelled | numeric | integer | logical | Date | |
|---|---|---|---|---|---|---|---|---|
| Identify miscoded missing values | × | × | × | × | × | × | × | |
| Identify prefixed and suffixed whitespace | × | × | × | × | ||||
| Identify levels with < 6 obs. | × | × | × | × | ||||
| Identify case issues | × | × | × | × | ||||
| Identify misclassified numeric or integer variables | × | × | × | × | ||||
| Identify outliers | × | × | × |
Please note that all numerical values in the following have been rounded to 2 decimals.
| Variable class | # unique values | Missing observations | Any problems? | |
|---|---|---|---|---|
| SkIdEmpresa | integer | 1 | 0.00 % | × |
| SkIdProyecto | integer | 81 | 0.00 % | × |
| SkIdFecha | integer | 3661 | 0.00 % | × |
| SkIdUsuario | integer | 136 | 0.00 % | × |
| No..Contrato | integer | 11950 | 0.00 % | × |
| Descripcion.Estado | character | 10 | 0.00 % | × |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 81 |
| Median | 100210 |
| 1st and 3rd quartiles | 100188; 100236 |
| Min. and max. | 1003; 100295 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 3661 |
| Median | 20230310 |
| 1st and 3rd quartiles | 20210212; 20240706 |
| Min. and max. | 20110616; 20251031 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 136 |
| Median | 100210 |
| 1st and 3rd quartiles | 100141; 100324 |
| Min. and max. | 10051; 100514 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 11950 |
| Median | 2100212 |
| 1st and 3rd quartiles | 1880148; 2360002 |
| Min. and max. | 30001; 2950003 |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 10 |
| Mode | “CREACION” |
Report generation information:
Created by: RamiroSeb (username:
SEBASTIAN).
Report creation time: dom nov. 02 2025 14:31:07
Report was run from directory:
D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review
dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]
R version 4.5.0 (2025-04-11 ucrt).
Platform: x86_64-w64-mingw32/x64(America/Bogota).
Function call:
makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)
The dataset examined has the following dimensions:
| Feature | Result |
|---|---|
| Number of observations | 31420 |
| Number of variables | 6 |
The following variable checks were performed, depending on the data type of each variable:
| character | factor | labelled | haven labelled | numeric | integer | logical | Date | |
|---|---|---|---|---|---|---|---|---|
| Identify miscoded missing values | × | × | × | × | × | × | × | |
| Identify prefixed and suffixed whitespace | × | × | × | × | ||||
| Identify levels with < 6 obs. | × | × | × | × | ||||
| Identify case issues | × | × | × | × | ||||
| Identify misclassified numeric or integer variables | × | × | × | × | ||||
| Identify outliers | × | × | × |
Please note that all numerical values in the following have been rounded to 2 decimals.
| Variable class | # unique values | Missing observations | Any problems? | |
|---|---|---|---|---|
| SkIdEmpresa | integer | 1 | 0.00 % | × |
| SkIdProyecto | integer | 47 | 0.00 % | × |
| SkIdFecha | integer | 1124 | 0.00 % | |
| SkIdUsuario | integer | 29 | 0.00 % | |
| No..Entrada | integer | 29188 | 0.00 % | × |
| Descripcion.Estado | character | 7 | 0.00 % |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 47 |
| Median | 100217 |
| 1st and 3rd quartiles | 100204; 100230 |
| Min. and max. | 10029; 100294 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 1124 |
| Median | 20230928 |
| 1st and 3rd quartiles | 20220526; 20240903 |
| Min. and max. | 20191118; 20251031 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 29 |
| Median | 100213 |
| 1st and 3rd quartiles | 100141; 100349 |
| Min. and max. | 100103; 100463 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 29188 |
| Median | 21700788.5 |
| 1st and 3rd quartiles | 20400400; 23000283 |
| Min. and max. | 290587; 29400005 |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 7 |
| Mode | “Programación y Aprobación de Entradas de Almacén” |
Report generation information:
Created by: RamiroSeb (username:
SEBASTIAN).
Report creation time: dom nov. 02 2025 14:31:09
Report was run from directory:
D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review
dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]
R version 4.5.0 (2025-04-11 ucrt).
Platform: x86_64-w64-mingw32/x64(America/Bogota).
Function call:
makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)
The dataset examined has the following dimensions:
| Feature | Result |
|---|---|
| Number of observations | 364087 |
| Number of variables | 8 |
The following variable checks were performed, depending on the data type of each variable:
| character | factor | labelled | haven labelled | numeric | integer | logical | Date | |
|---|---|---|---|---|---|---|---|---|
| Identify miscoded missing values | × | × | × | × | × | × | × | |
| Identify prefixed and suffixed whitespace | × | × | × | × | ||||
| Identify levels with < 6 obs. | × | × | × | × | ||||
| Identify case issues | × | × | × | × | ||||
| Identify misclassified numeric or integer variables | × | × | × | × | ||||
| Identify outliers | × | × | × |
Please note that all numerical values in the following have been rounded to 2 decimals.
| Variable class | # unique values | Missing observations | Any problems? | |
|---|---|---|---|---|
| SkIdEmpresa | integer | 1 | 0.00 % | × |
| SkIdProyecto | integer | 59 | 0.00 % | × |
| SkIdPedido | integer | 55504 | 0.00 % | × |
| SkIdInsumo | integer | 5581 | 0.00 % | × |
| SkIdFecha | integer | 1996 | 0.00 % | × |
| SkIdUsuario | integer | 83 | 0.00 % | × |
| SkIdEstado | integer | 7 | 0.00 % | × |
| EventoPedido | character | 5 | 0.00 % |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 59 |
| Median | 100217 |
| 1st and 3rd quartiles | 100211; 100239 |
| Min. and max. | 1006; 100295 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 55504 |
| Median | 100117732 |
| 1st and 3rd quartiles | 100102537; 100131545 |
| Min. and max. | 10019331; 100141328 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 5581 |
| Median | 1003590 |
| 1st and 3rd quartiles | 1001985; 10010498 |
| Min. and max. | 100101; 10019323 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 1996 |
| Median | 20240221 |
| 1st and 3rd quartiles | 20230118; 20241112 |
| Min. and max. | 20131024; 20251031 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 83 |
| Median | 100287 |
| 1st and 3rd quartiles | 100230; 100370 |
| Min. and max. | 100103; 100513 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 7 |
| Median | 10073 |
| 1st and 3rd quartiles | 10073; 10073 |
| Min. and max. | -10075; 10073 |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 5 |
| Mode | “Creacion” |
Report generation information:
Created by: RamiroSeb (username:
SEBASTIAN).
Report creation time: dom nov. 02 2025 14:31:14
Report was run from directory:
D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review
dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]
R version 4.5.0 (2025-04-11 ucrt).
Platform: x86_64-w64-mingw32/x64(America/Bogota).
Function call:
makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)
The dataset examined has the following dimensions:
| Feature | Result |
|---|---|
| Number of observations | 128467 |
| Number of variables | 20 |
The following variable checks were performed, depending on the data type of each variable:
| character | factor | labelled | haven labelled | numeric | integer | logical | Date | |
|---|---|---|---|---|---|---|---|---|
| Identify miscoded missing values | × | × | × | × | × | × | × | |
| Identify prefixed and suffixed whitespace | × | × | × | × | ||||
| Identify levels with < 6 obs. | × | × | × | × | ||||
| Identify case issues | × | × | × | × | ||||
| Identify misclassified numeric or integer variables | × | × | × | × | ||||
| Identify outliers | × | × | × |
Please note that all numerical values in the following have been rounded to 2 decimals.
| Variable class | # unique values | Missing observations | Any problems? | |
|---|---|---|---|---|
| SkIdEmpresa | integer | 1 | 0.00 % | × |
| SkIdProyecto | integer | 72 | 0.00 % | × |
| SkIdTercero | integer | 916 | 0.00 % | × |
| SkIdFechaCompra | integer | 3678 | 0.00 % | |
| SkIdFechaEntrega | integer | 4048 | 0.00 % | × |
| SkIdFechaCierre | numeric | 1505 | 70.32 % | |
| SkIdEstado | integer | 7 | 0.00 % | × |
| SkIdInsumo | integer | 7739 | 0.00 % | × |
| SkIdItems | integer | 14591 | 0.00 % | × |
| SkIdUsuario | integer | 45 | 0.00 % | × |
| SkIdOrigenDelDocumento | integer | 4 | 0.00 % | |
| SkIdEstadoEnvioDocumento | integer | 3 | 0.00 % | |
| Compra.No | integer | 28261 | 0.00 % | |
| Cantidad.Comprada | numeric | 21068 | 0.00 % | × |
| Valor.Unitario | numeric | 22720 | 0.00 % | × |
| IVA | numeric | 6 | 0.00 % | |
| Descuento | numeric | 105 | 0.00 % | × |
| Valor.Neto | numeric | 24597 | 0.00 % | × |
| Valor.IVA | numeric | 59543 | 0.00 % | × |
| Valor.Total | numeric | 65267 | 0.00 % | × |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 72 |
| Median | 100184 |
| 1st and 3rd quartiles | 100108; 100226 |
| Min. and max. | 1003; 100295 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 916 |
| Median | 5674 |
| 1st and 3rd quartiles | 4463; 8972 |
| Min. and max. | 71; 12696 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 3678 |
| Median | 20190718 |
| 1st and 3rd quartiles | 20151007; 20231023 |
| Min. and max. | 20110616; 20251031 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 4048 |
| Median | 20190518 |
| 1st and 3rd quartiles | 20150912; 20231003 |
| Min. and max. | 19000101; 20430702 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 90336 (70.32 %) |
| Number of unique values | 1504 |
| Median | 20200904 |
| 1st and 3rd quartiles | 20170124; 20240514 |
| Min. and max. | 20110705; 20251029 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 7 |
| Median | 10023 |
| 1st and 3rd quartiles | 10023; 10024 |
| Min. and max. | 10020; 10027 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 7739 |
| Median | 1004391 |
| 1st and 3rd quartiles | 1001664; 1009023 |
| Min. and max. | 100101; 10019323 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 14591 |
| Median | 10062373 |
| 1st and 3rd quartiles | 10027093; 10089774 |
| Min. and max. | 100; 100145625 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 45 |
| Median | 100230 |
| 1st and 3rd quartiles | 100164; 100287 |
| Min. and max. | 10048; 100499 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 4 |
| Mode | “4” |
| Reference category | 3 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 3 |
| Mode | “-1” |
| Reference category | -1 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 28261 |
| Median | 13400053 |
| 1st and 3rd quartiles | 1150065; 21700588 |
| Min. and max. | 30001; 29500001 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 21068 |
| Median | 10 |
| 1st and 3rd quartiles | 2; 80 |
| Min. and max. | 0; 1811700.43 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 22720 |
| Median | 11623 |
| 1st and 3rd quartiles | 3150; 59980 |
| Min. and max. | 0; 1087483487 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 6 |
| Median | 0.19 |
| 1st and 3rd quartiles | 0.16; 0.19 |
| Min. and max. | 0; 0.19 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 105 |
| Median | 0 |
| 1st and 3rd quartiles | 0; 0 |
| Min. and max. | 0; 1 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 24597 |
| Median | 11400 |
| 1st and 3rd quartiles | 3076.64; 58805 |
| Min. and max. | 0; 1087483487 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 59543 |
| Median | 27184.25 |
| 1st and 3rd quartiles | 3192; 199972.98 |
| Min. and max. | 0; 1170358477.78 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 65267 |
| Median | 237800 |
| 1st and 3rd quartiles | 37096.8; 1527138.9 |
| Min. and max. | 0; 7330139939.78 |
Report generation information:
Created by: RamiroSeb (username:
SEBASTIAN).
Report creation time: dom nov. 02 2025 14:31:21
Report was run from directory:
D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review
dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]
R version 4.5.0 (2025-04-11 ucrt).
Platform: x86_64-w64-mingw32/x64(America/Bogota).
Function call:
makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)
The dataset examined has the following dimensions:
| Feature | Result |
|---|---|
| Number of observations | 143120 |
| Number of variables | 20 |
The following variable checks were performed, depending on the data type of each variable:
| character | factor | labelled | haven labelled | numeric | integer | logical | Date | |
|---|---|---|---|---|---|---|---|---|
| Identify miscoded missing values | × | × | × | × | × | × | × | |
| Identify prefixed and suffixed whitespace | × | × | × | × | ||||
| Identify levels with < 6 obs. | × | × | × | × | ||||
| Identify case issues | × | × | × | × | ||||
| Identify misclassified numeric or integer variables | × | × | × | × | ||||
| Identify outliers | × | × | × |
Please note that all numerical values in the following have been rounded to 2 decimals.
| Variable class | # unique values | Missing observations | Any problems? | |
|---|---|---|---|---|
| SkIdEmpresa | integer | 1 | 0.00 % | × |
| Empresa | character | 1 | 0.00 % | × |
| SkIdProyecto | integer | 81 | 0.00 % | × |
| SkIdTercero | integer | 1632 | 0.00 % | |
| SkIdInsumo | integer | 6046 | 0.00 % | × |
| SkIdItems | integer | 26662 | 0.00 % | × |
| SkIdTipoContrato | integer | 53 | 0.00 % | × |
| SKIdEstado | integer | 4 | 0.00 % | |
| SkIdVariablesAdicionalesContratos | integer | 11916 | 0.00 % | × |
| SkIdEspecificacionDeContratos | numeric | 11916 | 0.00 % | × |
| Cantidad.Inicial | numeric | 13664 | 0.00 % | × |
| Cantidad | numeric | 21067 | 0.00 % | × |
| Valor.Unitario | numeric | 57008 | 0.00 % | × |
| Valor.Iva | numeric | 36717 | 0.00 % | × |
| Valor.Total | numeric | 79947 | 0.00 % | × |
| Valor.Contrato.Sin.IVA | numeric | 6541 | 31.10 % | × |
| Valor.Contrato | numeric | 9023 | 1.99 % | × |
| Numero.De.Grupo | integer | 470 | 0.00 % | × |
| Valor.Detalle | numeric | 69048 | 0.00 % | × |
| Valor.Detalle.Unitario | numeric | 60252 | 0.00 % | × |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 81 |
| Median | 100200 |
| 1st and 3rd quartiles | 100111; 100228 |
| Min. and max. | 1003; 100295 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 1632 |
| Median | 9058 |
| 1st and 3rd quartiles | 5082; 11778 |
| Min. and max. | 1; 12708 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 6046 |
| Median | 1006715 |
| 1st and 3rd quartiles | 1002351; 1009361 |
| Min. and max. | 100; 10019348 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 26662 |
| Median | 10067747 |
| 1st and 3rd quartiles | 10028783; 10082227 |
| Min. and max. | 100; 100145469 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 53 |
| Median | 25 |
| 1st and 3rd quartiles | 22; 47 |
| Min. and max. | 2; 63 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 4 |
| Mode | “10011” |
| Reference category | 10010 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 11916 |
| Median | 1002000103 |
| 1st and 3rd quartiles | 1001110044; 1002280082 |
| Min. and max. | 10030001; 1002950003 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 11916 |
| Median | 1002002000103 |
| 1st and 3rd quartiles | 1001111110044; 1002282280082 |
| Min. and max. | 100330001; 1002952950003 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 13664 |
| Median | 0 |
| 1st and 3rd quartiles | 0; 4 |
| Min. and max. | 0; 28572949385.16 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 21067 |
| Median | 1 |
| 1st and 3rd quartiles | 1; 23.6 |
| Min. and max. | -1236734210; 16209633178.65 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 57008 |
| Median | 99365 |
| 1st and 3rd quartiles | 18228.6; 981979.19 |
| Min. and max. | -13240606.52; 30420118402.79 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 36717 |
| Median | 107.22 |
| 1st and 3rd quartiles | 0; 5179.91 |
| Min. and max. | 0; 779596794.94 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 79947 |
| Median | 665512.47 |
| 1st and 3rd quartiles | 71400; 3761120.75 |
| Min. and max. | -1236734210; 16981211717.95 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 44517 (31.1 %) |
| Number of unique values | 6540 |
| Median | 90213429.44 |
| 1st and 3rd quartiles | 19747220.84; 503972858.67 |
| Min. and max. | -0.03; 117924027174.96 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 2848 (1.99 %) |
| Number of unique values | 9022 |
| Median | 50266796.39 |
| 1st and 3rd quartiles | 1e+07; 4.5e+08 |
| Min. and max. | -1193561299; 118785779681.24 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 470 |
| Median | 0 |
| 1st and 3rd quartiles | 0; 14 |
| Min. and max. | 0; 469 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 69048 |
| Median | 160000 |
| 1st and 3rd quartiles | 27000; 1285931.53 |
| Min. and max. | -1236734210; 30539808138.04 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 60252 |
| Median | 96158.98 |
| 1st and 3rd quartiles | 17900; 901324 |
| Min. and max. | -13240606.52; 30539808138.04 |
Report generation information:
Created by: RamiroSeb (username:
SEBASTIAN).
Report creation time: dom nov. 02 2025 14:31:31
Report was run from directory:
D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review
dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]
R version 4.5.0 (2025-04-11 ucrt).
Platform: x86_64-w64-mingw32/x64(America/Bogota).
Function call:
makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)
The dataset examined has the following dimensions:
| Feature | Result |
|---|---|
| Number of observations | 5770 |
| Number of variables | 11 |
The following variable checks were performed, depending on the data type of each variable:
| character | factor | labelled | haven labelled | numeric | integer | logical | Date | |
|---|---|---|---|---|---|---|---|---|
| Identify miscoded missing values | × | × | × | × | × | × | × | |
| Identify prefixed and suffixed whitespace | × | × | × | × | ||||
| Identify levels with < 6 obs. | × | × | × | × | ||||
| Identify case issues | × | × | × | × | ||||
| Identify misclassified numeric or integer variables | × | × | × | × | ||||
| Identify outliers | × | × | × |
Please note that all numerical values in the following have been rounded to 2 decimals.
| Variable class | # unique values | Missing observations | Any problems? | |
|---|---|---|---|---|
| SkIdEmpresa | integer | 1 | 0.00 % | × |
| SkIdProyecto | integer | 47 | 0.00 % | × |
| SkIdContrato | numeric | 1167 | 0.00 % | × |
| SkIdTercero | integer | 341 | 0.00 % | |
| SkIdEstado | integer | 3 | 0.00 % | |
| SkIdFechaVigenciaDesde | integer | 1075 | 0.00 % | × |
| SkIdFechaVigenciaHasta | integer | 2280 | 0.00 % | × |
| SkIdTipoPoliza | integer | 11 | 0.00 % | |
| PolizaNumero | character | 2315 | 0.00 % | × |
| ValorAsegurado | numeric | 3832 | 0.00 % | × |
| PorcentajeAsegurado | numeric | 432 | 0.00 % | × |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 47 |
| Median | 100210 |
| 1st and 3rd quartiles | 100157; 100230 |
| Min. and max. | 1003; 100295 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 1167 |
| Median | 1002102100156 |
| 1st and 3rd quartiles | 1001571570009; 1002302300247.75 |
| Min. and max. | 100330001; 1002952950001 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 341 |
| Median | 8434 |
| 1st and 3rd quartiles | 5279; 11914 |
| Min. and max. | 22; 12704 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 3 |
| Mode | “100111” |
| Reference category | -100111 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 1075 |
| Median | 20230115 |
| 1st and 3rd quartiles | 20181115; 20240701 |
| Min. and max. | 19230821; 20270117 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 2280 |
| Median | 20240531 |
| 1st and 3rd quartiles | 20191219.25; 20260612 |
| Min. and max. | 20110331; 20311027 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 11 |
| Median | 1004 |
| 1st and 3rd quartiles | 1002; 10023 |
| Min. and max. | 1001; 10039 |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 2315 |
| Mode | “1000171371001” |
The following values appear with prefixed or suffixed white space: " 21-40-101177568", " 3780475-1", " BQ-100063888", " NB-100049724", " NB-100247249", "65-54-101006364 ", "CBO-100015876 ", "SEPL-23572249-1 ".
Note that the following levels have at most five observations: " 21-40-101177568", " 3780475-1", " BQ-100063888", " NB-100049724", " NB-100247249", …, "NB 100387945", "No 3120402", "SEPL-23572249-1", "SEPL-23572249-1 ", "SEPL10689104-1" (2257 values omitted).
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 3832 |
| Median | 17918540.1 |
| 1st and 3rd quartiles | 4663721.85; 69541714.7 |
| Min. and max. | 0; 54177206177.78 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 432 |
| Median | 20 |
| 1st and 3rd quartiles | 19.54; 30 |
| Min. and max. | 0; 205560.16 |
Report generation information:
Created by: RamiroSeb (username:
SEBASTIAN).
Report creation time: dom nov. 02 2025 14:31:35
Report was run from directory:
D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review
dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]
R version 4.5.0 (2025-04-11 ucrt).
Platform: x86_64-w64-mingw32/x64(America/Bogota).
Function call:
makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)
The dataset examined has the following dimensions:
| Feature | Result |
|---|---|
| Number of observations | 1632800 |
| Number of variables | 13 |
The following variable checks were performed, depending on the data type of each variable:
| character | factor | labelled | haven labelled | numeric | integer | logical | Date | |
|---|---|---|---|---|---|---|---|---|
| Identify miscoded missing values | × | × | × | × | × | × | × | |
| Identify prefixed and suffixed whitespace | × | × | × | × | ||||
| Identify levels with < 6 obs. | × | × | × | × | ||||
| Identify case issues | × | × | × | × | ||||
| Identify misclassified numeric or integer variables | × | × | × | × | ||||
| Identify outliers | × | × | × |
Please note that all numerical values in the following have been rounded to 2 decimals.
| Variable class | # unique values | Missing observations | Any problems? | |
|---|---|---|---|---|
| SkIdEmpresa | integer | 1 | 0.00 % | × |
| Empresa | character | 1 | 0.00 % | × |
| SkIdProyecto | integer | 88 | 0.00 % | × |
| SkIdFecha | integer | 4579 | 0.00 % | × |
| SkIdClaseOrigen | integer | 22 | 0.00 % | × |
| SkIdInsumo | integer | 13322 | 0.00 % | × |
| SkIdCapitulo | numeric | 1996 | 0.00 % | |
| SkIdItems | numeric | 47601 | 0.82 % | × |
| Cantidad | numeric | 226445 | 0.00 % | × |
| Valor.Total | numeric | 579815 | 0.00 % | × |
| Origen.Documento | numeric | 397538 | 0.10 % | × |
| Origen.Documento.Detalle | integer | 7394 | 0.00 % | × |
| Valor.Sin.IVA | numeric | 439582 | 13.11 % | × |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 88 |
| Median | 100174 |
| 1st and 3rd quartiles | 100110; 100226 |
| Min. and max. | 1003; 100295 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 4579 |
| Median | 20180831 |
| 1st and 3rd quartiles | 20140416; 20230502 |
| Min. and max. | 19000101; 20260201 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 22 |
| Median | 13 |
| 1st and 3rd quartiles | 10; 29 |
| Min. and max. | 1; 33 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 13322 |
| Median | 1005402 |
| 1st and 3rd quartiles | 1001985; 1008730 |
| Min. and max. | 100101; 10019356 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 38 (0 %) |
| Number of unique values | 1995 |
| Median | 1001712228 |
| 1st and 3rd quartiles | 100351175; 1002263096 |
| Min. and max. | -910024120; 1002954543 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 13438 (0.82 %) |
| Number of unique values | 47600 |
| Median | 10064713 |
| 1st and 3rd quartiles | 10028648; 10085206 |
| Min. and max. | 100; 100145937 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 35 (0 %) |
| Number of unique values | 226444 |
| Median | 4 |
| 1st and 3rd quartiles | 1; 49 |
| Min. and max. | -8.145313e+13; 8.145324e+13 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 579815 |
| Median | 288494.73 |
| 1st and 3rd quartiles | 24584.4; 2133066.75 |
| Min. and max. | -521909894612433; 521909894612433 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 1653 (0.1 %) |
| Number of unique values | 397537 |
| Median | 118986 |
| 1st and 3rd quartiles | 27984; 243217 |
| Min. and max. | -1; 174000235 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 7394 |
| Median | -1 |
| 1st and 3rd quartiles | -1; -1 |
| Min. and max. | -1; 86399 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 213986 (13.11 %) |
| Number of unique values | 439581 |
| Median | 326408.65 |
| 1st and 3rd quartiles | 36100; 2236405.88 |
| Min. and max. | -443849906571097; 443849906571097 |
Report generation information:
Created by: RamiroSeb (username:
SEBASTIAN).
Report creation time: dom nov. 02 2025 14:32:38
Report was run from directory:
D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review
dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]
R version 4.5.0 (2025-04-11 ucrt).
Platform: x86_64-w64-mingw32/x64(America/Bogota).
Function call:
makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)
The dataset examined has the following dimensions:
| Feature | Result |
|---|---|
| Number of observations | 1008 |
| Number of variables | 13 |
The following variable checks were performed, depending on the data type of each variable:
| character | factor | labelled | haven labelled | numeric | integer | logical | Date | |
|---|---|---|---|---|---|---|---|---|
| Identify miscoded missing values | × | × | × | × | × | × | × | |
| Identify prefixed and suffixed whitespace | × | × | × | × | ||||
| Identify levels with < 6 obs. | × | × | × | × | ||||
| Identify case issues | × | × | × | × | ||||
| Identify misclassified numeric or integer variables | × | × | × | × | ||||
| Identify outliers | × | × | × |
Please note that all numerical values in the following have been rounded to 2 decimals.
| Variable class | # unique values | Missing observations | Any problems? | |
|---|---|---|---|---|
| SkIdEmpresa | integer | 1 | 0.00 % | × |
| SkIdProyecto | integer | 56 | 0.00 % | × |
| SkIdTercero | integer | 107 | 0.00 % | |
| SkIdInsumo | integer | 569 | 0.00 % | × |
| SkIdFecha | integer | 377 | 0.00 % | |
| SkIdEstado | integer | 4 | 0.00 % | × |
| SkIdBodega | integer | 54 | 0.00 % | × |
| Devolucion.Numero | integer | 549 | 0.00 % | × |
| Remision | character | 331 | 0.00 % | × |
| Total | numeric | 847 | 0.00 % | × |
| Devolucion.Factura | character | 175 | 0.00 % | × |
| Cantidad.Devuelta | integer | 181 | 0.00 % | × |
| Compra.No | integer | 415 | 0.00 % | × |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 56 |
| Median | 100210 |
| 1st and 3rd quartiles | 100167; 100226 |
| Min. and max. | 1005; 100277 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 107 |
| Median | 5163 |
| 1st and 3rd quartiles | 2095; 8603 |
| Min. and max. | 1919; 12678 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 569 |
| Median | 1005183.5 |
| 1st and 3rd quartiles | 1002094.75; 10010618 |
| Min. and max. | 100114; 10018213 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 377 |
| Median | 20220616 |
| 1st and 3rd quartiles | 20191128; 20240327 |
| Min. and max. | 20130812; 20251029 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 4 |
| Mode | “10053” |
| Reference category | 10050 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 54 |
| Median | 1000204 |
| 1st and 3rd quartiles | 1000163; 1000225 |
| Min. and max. | 100; 1000277 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 549 |
| Median | 20400012.5 |
| 1st and 3rd quartiles | 1840007.75; 22500001 |
| Min. and max. | 50014; 27700001 |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 331 |
| Mode | “” |
The following suspected missing value codes enter as regular values: "".
Note that the following levels have at most five observations: "-00439194", "00000000000965", "000288406", "001", "001-2", …, "RM13015118", "RM13015812", "rm48-694", "Vales: 178, 187, 188, 177 - Arena y grava para cic", "xxxx" (312 values omitted).
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 847 |
| Median | 277830 |
| 1st and 3rd quartiles | 50654.38; 1586548.91 |
| Min. and max. | 0; 111091518.6 |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 175 |
| Mode | “” |
The following suspected missing value codes enter as regular values: "".
Note that the following levels have at most five observations: "000232", "001", "001-00000047880", "0036", "02", …, "NCSK990010781", "NCV23", "NCV24", "SALIDA POR AJUSTE", "x" (157 values omitted).
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 181 |
| Median | 10 |
| 1st and 3rd quartiles | 2; 55 |
| Min. and max. | 0; 20000 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 415 |
| Median | 20400226 |
| 1st and 3rd quartiles | 15700390.75; 22500006 |
| Min. and max. | 51474; 27700013 |
Report generation information:
Created by: RamiroSeb (username:
SEBASTIAN).
Report creation time: dom nov. 02 2025 14:32:42
Report was run from directory:
D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review
dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]
R version 4.5.0 (2025-04-11 ucrt).
Platform: x86_64-w64-mingw32/x64(America/Bogota).
Function call:
makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)
The dataset examined has the following dimensions:
| Feature | Result |
|---|---|
| Number of observations | 905 |
| Number of variables | 12 |
The following variable checks were performed, depending on the data type of each variable:
| character | factor | labelled | haven labelled | numeric | integer | logical | Date | |
|---|---|---|---|---|---|---|---|---|
| Identify miscoded missing values | × | × | × | × | × | × | × | |
| Identify prefixed and suffixed whitespace | × | × | × | × | ||||
| Identify levels with < 6 obs. | × | × | × | × | ||||
| Identify case issues | × | × | × | × | ||||
| Identify misclassified numeric or integer variables | × | × | × | × | ||||
| Identify outliers | × | × | × |
Please note that all numerical values in the following have been rounded to 2 decimals.
| Variable class | # unique values | Missing observations | Any problems? | |
|---|---|---|---|---|
| SkIdEmpresa | integer | 1 | 0.00 % | × |
| SkIdProyecto | integer | 4 | 0.00 % | × |
| SkIdFecha | integer | 19 | 0.00 % | × |
| SkIdItems | integer | 238 | 0.00 % | × |
| SkIdCapitulo | integer | 13 | 0.00 % | × |
| SkIdEstado | integer | 1 | 0.00 % | × |
| SkIdEspecificacionEjecucionCliente | numeric | 28 | 0.00 % | × |
| Valor.Garantia | numeric | 17 | 0.00 % | |
| Valor.Amortizacion | numeric | 8 | 0.00 % | |
| Cantidad.Ejecucion.Cliente | numeric | 592 | 0.00 % | × |
| Valor.Unitario | numeric | 185 | 0.00 % | × |
| Valor.Total | numeric | 737 | 0.00 % | × |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 4 |
| Mode | “100188” |
| Reference category | 1006 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 19 |
| Median | 20200519 |
| 1st and 3rd quartiles | 20191105; 20201013 |
| Min. and max. | 20120216; 20250113 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 238 |
| Median | 10061380 |
| 1st and 3rd quartiles | 10061316; 10064802 |
| Min. and max. | 1006736; 100113091 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 13 |
| Median | 1001882511 |
| 1st and 3rd quartiles | 1001882510; 1001882512 |
| Min. and max. | 1006157; 1002753954 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 28 |
| Median | 10018813 |
| 1st and 3rd quartiles | 1001887; 10018818 |
| Min. and max. | 10061; 10027527500003 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 17 |
| Median | 51796257.1 |
| 1st and 3rd quartiles | 0; 158295413.8 |
| Min. and max. | 0; 197374360.4 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 8 |
| Median | 0 |
| 1st and 3rd quartiles | 0; 76392772 |
| Min. and max. | 0; 354584821 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 592 |
| Median | 45.73 |
| 1st and 3rd quartiles | 2; 383.68 |
| Min. and max. | -7001.4; 141305.29 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 185 |
| Median | 75900 |
| 1st and 3rd quartiles | 8100; 1041389 |
| Min. and max. | 0; 1.4e+09 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 737 |
| Median | 5862311.13 |
| 1st and 3rd quartiles | 1512001.89; 24857044.48 |
| Min. and max. | -188593765.29; 1.4e+09 |
Report generation information:
Created by: RamiroSeb (username:
SEBASTIAN).
Report creation time: dom nov. 02 2025 14:32:47
Report was run from directory:
D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review
dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]
R version 4.5.0 (2025-04-11 ucrt).
Platform: x86_64-w64-mingw32/x64(America/Bogota).
Function call:
makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)
The dataset examined has the following dimensions:
| Feature | Result |
|---|---|
| Number of observations | 2712 |
| Number of variables | 11 |
The following variable checks were performed, depending on the data type of each variable:
| character | factor | labelled | haven labelled | numeric | integer | logical | Date | |
|---|---|---|---|---|---|---|---|---|
| Identify miscoded missing values | × | × | × | × | × | × | × | |
| Identify prefixed and suffixed whitespace | × | × | × | × | ||||
| Identify levels with < 6 obs. | × | × | × | × | ||||
| Identify case issues | × | × | × | × | ||||
| Identify misclassified numeric or integer variables | × | × | × | × | ||||
| Identify outliers | × | × | × |
Please note that all numerical values in the following have been rounded to 2 decimals.
| Variable class | # unique values | Missing observations | Any problems? | |
|---|---|---|---|---|
| SkIdEmpresa | integer | 1 | 0.00 % | × |
| SkIdProyecto | integer | 10 | 0.00 % | |
| SkIdItems | integer | 1235 | 0.00 % | × |
| SkIdCapitulo | integer | 94 | 0.00 % | |
| SkIdFecha | integer | 51 | 0.00 % | × |
| SkIdEstado | integer | 1 | 0.00 % | × |
| Numero.Ejecucion | integer | 48 | 0.00 % | × |
| Cantidad.Ejecucion | numeric | 1317 | 0.00 % | × |
| Valor.Unitario | numeric | 965 | 0.00 % | × |
| Valor.Unitario.Presupuesto | numeric | 900 | 0.00 % | × |
| Valor.Total | numeric | 1943 | 0.00 % | × |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 10 |
| Median | 100188 |
| 1st and 3rd quartiles | 10028; 100188 |
| Min. and max. | 1003; 100275 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 1235 |
| Median | 10061361 |
| 1st and 3rd quartiles | 10015697.75; 10062381 |
| Min. and max. | 1002462; 100124924 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 94 |
| Median | 1001882510 |
| 1st and 3rd quartiles | 10028579; 1001882521 |
| Min. and max. | 100346; 1002753954 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 51 |
| Median | 20191126 |
| 1st and 3rd quartiles | 20131203; 20200908 |
| Min. and max. | 20111021; 20250128 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 48 |
| Median | 14 |
| 1st and 3rd quartiles | 3; 31 |
| Min. and max. | 1; 27500007 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 1317 |
| Median | 8 |
| 1st and 3rd quartiles | 1; 157.32 |
| Min. and max. | -32013; 272810.2 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 965 |
| Median | 159399.95 |
| 1st and 3rd quartiles | 26855.12; 1394000 |
| Min. and max. | 0; 1.4e+09 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 900 |
| Median | 174239.44 |
| 1st and 3rd quartiles | 27999.99; 1461403 |
| Min. and max. | 0; 1317647059 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 1943 |
| Median | 4110720 |
| 1st and 3rd quartiles | 685120; 14916988 |
| Min. and max. | -188593765.29; 1070372500000 |
Report generation information:
Created by: RamiroSeb (username:
SEBASTIAN).
Report creation time: dom nov. 02 2025 14:32:53
Report was run from directory:
D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review
dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]
R version 4.5.0 (2025-04-11 ucrt).
Platform: x86_64-w64-mingw32/x64(America/Bogota).
Function call:
makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)
The dataset examined has the following dimensions:
| Feature | Result |
|---|---|
| Number of observations | 120665 |
| Number of variables | 16 |
The following variable checks were performed, depending on the data type of each variable:
| character | factor | labelled | haven labelled | numeric | integer | logical | Date | |
|---|---|---|---|---|---|---|---|---|
| Identify miscoded missing values | × | × | × | × | × | × | × | |
| Identify prefixed and suffixed whitespace | × | × | × | × | ||||
| Identify levels with < 6 obs. | × | × | × | × | ||||
| Identify case issues | × | × | × | × | ||||
| Identify misclassified numeric or integer variables | × | × | × | × | ||||
| Identify outliers | × | × | × |
Please note that all numerical values in the following have been rounded to 2 decimals.
| Variable class | # unique values | Missing observations | Any problems? | |
|---|---|---|---|---|
| SkIdEmpresa | integer | 1 | 0.00 % | × |
| SkIdProyecto | integer | 68 | 0.00 % | |
| SkIdFechaCompra | integer | 3483 | 0.00 % | |
| SkIdFechaEntrada | integer | 3895 | 0.00 % | |
| SkIdTercero | integer | 750 | 0.00 % | × |
| SkIdInsumo | integer | 6858 | 0.00 % | × |
| SkIdBodega | integer | 68 | 0.00 % | |
| SkIdEstadoPorDocumento | integer | 5 | 0.00 % | × |
| SkIdEspecificacionEntradasAlmacen | numeric | 57476 | 0.00 % | |
| Total.Entrada | numeric | 58481 | 0.00 % | × |
| Compra.Numero | integer | 23185 | 0.00 % | × |
| Compra.Total.Pagar | numeric | 19682 | 0.00 % | × |
| Entrada.Valor.Iva | numeric | 54069 | 0.00 % | × |
| Entrada.Valor.Sin.Iva | numeric | 53998 | 0.00 % | × |
| Entrada.Cantidad | numeric | 12348 | 0.00 % | × |
| Entrada.Valor.Amortizado | numeric | 8057 | 0.00 % | × |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 68 |
| Median | 100157 |
| 1st and 3rd quartiles | 10035; 100217 |
| Min. and max. | 1003; 100294 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 3483 |
| Median | 20190517 |
| 1st and 3rd quartiles | 20150119; 20230629 |
| Min. and max. | 20110616; 20251030 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 3895 |
| Median | 20190628 |
| 1st and 3rd quartiles | 20150217; 20230817 |
| Min. and max. | 20110630; 20251031 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 750 |
| Median | 5485 |
| 1st and 3rd quartiles | 4553; 8489 |
| Min. and max. | 71; 12696 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 6858 |
| Median | 1003692 |
| 1st and 3rd quartiles | 1001775; 1008226 |
| Min. and max. | 100101; 10019275 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 68 |
| Median | 1000157 |
| 1st and 3rd quartiles | 100035; 1000217 |
| Min. and max. | 10003; 1000294 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 5 |
| Mode | “100134” |
| Reference category | 100130 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 57476 |
| Median | 1001571570761 |
| 1st and 3rd quartiles | 10035350079; 10021721700238 |
| Min. and max. | 100330001; 10029429400005 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 58481 |
| Median | 588874 |
| 1st and 3rd quartiles | 79642; 3036581.67 |
| Min. and max. | 0; 3057433903.72 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 23185 |
| Median | 15700060 |
| 1st and 3rd quartiles | 350599; 21700041 |
| Min. and max. | 30001; 29400001 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 19682 |
| Median | 9737910 |
| 1st and 3rd quartiles | 1276000; 89250000 |
| Min. and max. | -20230; 8922395811.7 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 54069 |
| Median | 60000 |
| 1st and 3rd quartiles | 6650; 424870.4 |
| Min. and max. | 0; 488161715.72 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 53998 |
| Median | 504400 |
| 1st and 3rd quartiles | 68100; 2589202 |
| Min. and max. | 0; 2569272188 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 12348 |
| Median | 12 |
| 1st and 3rd quartiles | 4; 100 |
| Min. and max. | 0; 95943 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 8057 |
| Median | 0 |
| 1st and 3rd quartiles | 0; 0 |
| Min. and max. | 0; 826031795 |
Report generation information:
Created by: RamiroSeb (username:
SEBASTIAN).
Report creation time: dom nov. 02 2025 14:33:02
Report was run from directory:
D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review
dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]
R version 4.5.0 (2025-04-11 ucrt).
Platform: x86_64-w64-mingw32/x64(America/Bogota).
Function call:
makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)
The dataset examined has the following dimensions:
| Feature | Result |
|---|---|
| Number of observations | 335598 |
| Number of variables | 13 |
The following variable checks were performed, depending on the data type of each variable:
| character | factor | labelled | haven labelled | numeric | integer | logical | Date | |
|---|---|---|---|---|---|---|---|---|
| Identify miscoded missing values | × | × | × | × | × | × | × | |
| Identify prefixed and suffixed whitespace | × | × | × | × | ||||
| Identify levels with < 6 obs. | × | × | × | × | ||||
| Identify case issues | × | × | × | × | ||||
| Identify misclassified numeric or integer variables | × | × | × | × | ||||
| Identify outliers | × | × | × |
Please note that all numerical values in the following have been rounded to 2 decimals.
| Variable class | # unique values | Missing observations | Any problems? | |
|---|---|---|---|---|
| SkIdEmpresa | integer | 1 | 0.00 % | × |
| Empresa | character | 1 | 0.00 % | × |
| SkIdFecha | integer | 4203 | 0.00 % | |
| SkIdProyecto | integer | 71 | 0.00 % | |
| SkIdInsumo | integer | 7382 | 0.00 % | × |
| Tipo | character | 9 | 0.00 % | |
| Documento | integer | 69753 | 0.00 % | |
| Bodega | integer | 1 | 0.00 % | × |
| Cantidad | numeric | 30147 | 0.00 % | × |
| Unitario.Neto | numeric | 53758 | 0.00 % | × |
| Valor.Iva | numeric | 58609 | 0.00 % | × |
| Unitario | numeric | 53758 | 0.00 % | × |
| Total | numeric | 155049 | 0.00 % | × |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 4203 |
| Median | 20190226 |
| 1st and 3rd quartiles | 20150417; 20230529 |
| Min. and max. | 20110630; 20251031 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 71 |
| Median | 100167 |
| 1st and 3rd quartiles | 10035; 100226 |
| Min. and max. | 1003; 100294 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 7382 |
| Median | 1003966 |
| 1st and 3rd quartiles | 1001614; 1008512 |
| Min. and max. | 100101; 10019275 |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 9 |
| Mode | “SA” |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 69753 |
| Median | 1880865 |
| 1st and 3rd quartiles | 310563.25; 21000541 |
| Min. and max. | -10; 174000235 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 30147 |
| Median | -1 |
| 1st and 3rd quartiles | -10; 7 |
| Min. and max. | -743487.2; 272825 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 53758 |
| Median | 10115 |
| 1st and 3rd quartiles | 2813.61; 48000 |
| Min. and max. | -1160; 1087483487 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 58609 |
| Median | 0 |
| 1st and 3rd quartiles | 0; 17920 |
| Min. and max. | 0; 488161715.72 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 53758 |
| Median | 10115 |
| 1st and 3rd quartiles | 2813.61; 48000 |
| Min. and max. | -1160; 1087483487 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 155049 |
| Median | -7638.18 |
| 1st and 3rd quartiles | -170836.4; 220000 |
| Min. and max. | -3627960661.17; 3057433903.72 |
Report generation information:
Created by: RamiroSeb (username:
SEBASTIAN).
Report creation time: dom nov. 02 2025 14:33:17
Report was run from directory:
D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review
dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]
R version 4.5.0 (2025-04-11 ucrt).
Platform: x86_64-w64-mingw32/x64(America/Bogota).
Function call:
makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)
The dataset examined has the following dimensions:
| Feature | Result |
|---|---|
| Number of observations | 220 |
| Number of variables | 9 |
The following variable checks were performed, depending on the data type of each variable:
| character | factor | labelled | haven labelled | numeric | integer | logical | Date | |
|---|---|---|---|---|---|---|---|---|
| Identify miscoded missing values | × | × | × | × | × | × | × | |
| Identify prefixed and suffixed whitespace | × | × | × | × | ||||
| Identify levels with < 6 obs. | × | × | × | × | ||||
| Identify case issues | × | × | × | × | ||||
| Identify misclassified numeric or integer variables | × | × | × | × | ||||
| Identify outliers | × | × | × |
Please note that all numerical values in the following have been rounded to 2 decimals.
| Variable class | # unique values | Missing observations | Any problems? | |
|---|---|---|---|---|
| SkIdEmpresa | integer | 1 | 0.00 % | × |
| SkIdTercero | integer | 33 | 0.00 % | × |
| SkIdProyecto | integer | 36 | 0.00 % | × |
| SkIdFecha | integer | 110 | 0.00 % | × |
| SkIdInsumo | integer | 75 | 0.00 % | × |
| SkIdEstado | integer | 3 | 0.00 % | × |
| Nota.Numero | integer | 143 | 0.00 % | |
| Total.devolucion | numeric | 179 | 0.00 % | × |
| Empresa | character | 1 | 0.00 % | × |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 33 |
| Median | 4814 |
| 1st and 3rd quartiles | 4814; 8489 |
| Min. and max. | 1920; 12296 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 36 |
| Median | 100135 |
| 1st and 3rd quartiles | 100118; 100228 |
| Min. and max. | 1003; 100275 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 110 |
| Median | 20180757 |
| 1st and 3rd quartiles | 20171019; 20200109 |
| Min. and max. | 20130401; 20251009 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 75 |
| Median | 1003477 |
| 1st and 3rd quartiles | 1001983; 10012089.5 |
| Min. and max. | 100143; 10018010 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 3 |
| Mode | “10043” |
| Reference category | 10040 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 143 |
| Median | 120.5 |
| 1st and 3rd quartiles | 89.75; 16700003.25 |
| Min. and max. | 3; 27500002 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 179 |
| Median | -922822.54 |
| 1st and 3rd quartiles | -2402847; -205490.75 |
| Min. and max. | -36534714; 1118525 |
Report generation information:
Created by: RamiroSeb (username:
SEBASTIAN).
Report creation time: dom nov. 02 2025 14:33:19
Report was run from directory:
D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review
dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]
R version 4.5.0 (2025-04-11 ucrt).
Platform: x86_64-w64-mingw32/x64(America/Bogota).
Function call:
makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)
The dataset examined has the following dimensions:
| Feature | Result |
|---|---|
| Number of observations | 100732 |
| Number of variables | 10 |
The following variable checks were performed, depending on the data type of each variable:
| character | factor | labelled | haven labelled | numeric | integer | logical | Date | |
|---|---|---|---|---|---|---|---|---|
| Identify miscoded missing values | × | × | × | × | × | × | × | |
| Identify prefixed and suffixed whitespace | × | × | × | × | ||||
| Identify levels with < 6 obs. | × | × | × | × | ||||
| Identify case issues | × | × | × | × | ||||
| Identify misclassified numeric or integer variables | × | × | × | × | ||||
| Identify outliers | × | × | × |
Please note that all numerical values in the following have been rounded to 2 decimals.
| Variable class | # unique values | Missing observations | Any problems? | |
|---|---|---|---|---|
| SkIdEmpresa | integer | 1 | 0.00 % | × |
| SkIdProyecto | integer | 59 | 0.00 % | × |
| SkIdCapitulo | integer | 597 | 0.00 % | × |
| SkIdPedido | integer | 55504 | 0.00 % | × |
| SkIdItems | integer | 7313 | 0.00 % | × |
| SkIdInsumo | integer | 5581 | 0.00 % | × |
| SkIdFechaPedido | integer | 1866 | 0.00 % | × |
| SkIdFechaRequerido | integer | 2043 | 0.00 % | × |
| SkIdEstado | integer | 7 | 0.00 % | × |
| Cantidad | numeric | 18224 | 0.00 % | × |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 59 |
| Median | 100217 |
| 1st and 3rd quartiles | 100211; 100239 |
| Min. and max. | 1006; 100295 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 597 |
| Median | 1002173804 |
| 1st and 3rd quartiles | 1002112832; 1002393392 |
| Min. and max. | 1006166; 1002954538 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 55504 |
| Median | 100118050 |
| 1st and 3rd quartiles | 100103631.75; 100132065 |
| Min. and max. | 10019331; 100141328 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 7313 |
| Median | 10097995 |
| 1st and 3rd quartiles | 10079534; 100108006 |
| Min. and max. | 1006814; 100144963 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 5581 |
| Median | 1003189 |
| 1st and 3rd quartiles | 1001985; 10010498 |
| Min. and max. | 100101; 10019323 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 1866 |
| Median | 20240220 |
| 1st and 3rd quartiles | 20230214; 20240511 |
| Min. and max. | 20131024; 20251031 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 2043 |
| Median | 20240221 |
| 1st and 3rd quartiles | 20230214; 20240514 |
| Min. and max. | 19000101; 20430621 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 7 |
| Median | 10073 |
| 1st and 3rd quartiles | 10073; 10073 |
| Min. and max. | -10075; 10073 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 18224 |
| Median | 6 |
| 1st and 3rd quartiles | 1; 50 |
| Min. and max. | 0; 1821837.73 |
Report generation information:
Created by: RamiroSeb (username:
SEBASTIAN).
Report creation time: dom nov. 02 2025 14:33:25
Report was run from directory:
D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review
dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]
R version 4.5.0 (2025-04-11 ucrt).
Platform: x86_64-w64-mingw32/x64(America/Bogota).
Function call:
makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)
The dataset examined has the following dimensions:
| Feature | Result |
|---|---|
| Number of observations | 0 |
| Number of variables | 12 |
The following variable checks were performed, depending on the data type of each variable:
| character | factor | labelled | haven labelled | numeric | integer | logical | Date | |
|---|---|---|---|---|---|---|---|---|
| Identify miscoded missing values | × | × | × | × | × | × | × | |
| Identify prefixed and suffixed whitespace | × | × | × | × | ||||
| Identify levels with < 6 obs. | × | × | × | × | ||||
| Identify case issues | × | × | × | × | ||||
| Identify misclassified numeric or integer variables | × | × | × | × | ||||
| Identify outliers | × | × | × |
Please note that all numerical values in the following have been rounded to 2 decimals.
| Variable class | # unique values | Missing observations | Any problems? | |
|---|---|---|---|---|
| SkIdEmpresa | logical | 0 | NaN % | × |
| SkIdZona | logical | 0 | NaN % | × |
| SkIdInsumo | logical | 0 | NaN % | × |
| SkIdTercero | logical | 0 | NaN % | × |
| SkIdFechaCotizacion | logical | 0 | NaN % | × |
| SkIdFechaVigencia | logical | 0 | NaN % | × |
| ValorSinIVA | logical | 0 | NaN % | × |
| PorcentajeDescuento | logical | 0 | NaN % | × |
| IVA | logical | 0 | NaN % | × |
| CantidadMinima | logical | 0 | NaN % | × |
| DiasMaxmoParaEntrega | logical | 0 | NaN % | × |
| ProveedorPrincipal | logical | 0 | NaN % | × |
The variable is a key (distinct values for each observation).
The variable only takes one value: "NA".
The variable is a key (distinct values for each observation).
The variable only takes one value: "NA".
The variable is a key (distinct values for each observation).
The variable only takes one value: "NA".
The variable is a key (distinct values for each observation).
The variable only takes one value: "NA".
The variable is a key (distinct values for each observation).
The variable only takes one value: "NA".
The variable is a key (distinct values for each observation).
The variable only takes one value: "NA".
The variable is a key (distinct values for each observation).
The variable only takes one value: "NA".
The variable is a key (distinct values for each observation).
The variable only takes one value: "NA".
The variable is a key (distinct values for each observation).
The variable only takes one value: "NA".
The variable is a key (distinct values for each observation).
The variable only takes one value: "NA".
The variable is a key (distinct values for each observation).
The variable only takes one value: "NA".
The variable is a key (distinct values for each observation).
The variable only takes one value: "NA".
Report generation information:
Created by: RamiroSeb (username:
SEBASTIAN).
Report creation time: dom nov. 02 2025 14:33:30
Report was run from directory:
D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review
dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]
R version 4.5.0 (2025-04-11 ucrt).
Platform: x86_64-w64-mingw32/x64(America/Bogota).
Function call:
makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)
The dataset examined has the following dimensions:
| Feature | Result |
|---|---|
| Number of observations | 421 |
| Number of variables | 7 |
The following variable checks were performed, depending on the data type of each variable:
| character | factor | labelled | haven labelled | numeric | integer | logical | Date | |
|---|---|---|---|---|---|---|---|---|
| Identify miscoded missing values | × | × | × | × | × | × | × | |
| Identify prefixed and suffixed whitespace | × | × | × | × | ||||
| Identify levels with < 6 obs. | × | × | × | × | ||||
| Identify case issues | × | × | × | × | ||||
| Identify misclassified numeric or integer variables | × | × | × | × | ||||
| Identify outliers | × | × | × |
Please note that all numerical values in the following have been rounded to 2 decimals.
| Variable class | # unique values | Missing observations | Any problems? | |
|---|---|---|---|---|
| SkIdEmpresa | integer | 1 | 0.00 % | × |
| SkIdProyecto | integer | 1 | 0.00 % | × |
| SkIdActividad | numeric | 419 | 0.00 % | × |
| SkIdFechaInicial | integer | 132 | 0.00 % | |
| SkIdFechaFinal | integer | 146 | 0.00 % | |
| Duracion | integer | 66 | 0.00 % | × |
| PorcentajeAsignado | numeric | 13 | 0.00 % | × |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 419 |
| Median | 1004355731714487 |
| 1st and 3rd quartiles | 1002050842307332; 1006724863852852 |
| Min. and max. | 1004758460013; 1008861420467332 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 132 |
| Median | 20260530 |
| 1st and 3rd quartiles | 20250617; 20270104 |
| Min. and max. | 20241202; 20270626 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 146 |
| Median | 20260815 |
| 1st and 3rd quartiles | 20250830; 20270213 |
| Min. and max. | 20241202; 20270803 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 66 |
| Median | 10 |
| 1st and 3rd quartiles | 4; 53 |
| Min. and max. | 0; 766 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 13 |
| Median | 0 |
| 1st and 3rd quartiles | 0; 0 |
| Min. and max. | 0; 1 |
Report generation information:
Created by: RamiroSeb (username:
SEBASTIAN).
Report creation time: dom nov. 02 2025 14:33:31
Report was run from directory:
D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review
dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]
R version 4.5.0 (2025-04-11 ucrt).
Platform: x86_64-w64-mingw32/x64(America/Bogota).
Function call:
makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)
The dataset examined has the following dimensions:
| Feature | Result |
|---|---|
| Number of observations | 275143 |
| Number of variables | 19 |
The following variable checks were performed, depending on the data type of each variable:
| character | factor | labelled | haven labelled | numeric | integer | logical | Date | |
|---|---|---|---|---|---|---|---|---|
| Identify miscoded missing values | × | × | × | × | × | × | × | |
| Identify prefixed and suffixed whitespace | × | × | × | × | ||||
| Identify levels with < 6 obs. | × | × | × | × | ||||
| Identify case issues | × | × | × | × | ||||
| Identify misclassified numeric or integer variables | × | × | × | × | ||||
| Identify outliers | × | × | × |
Please note that all numerical values in the following have been rounded to 2 decimals.
| Variable class | # unique values | Missing observations | Any problems? | |
|---|---|---|---|---|
| SkIdEmpresa | integer | 1 | 0.00 % | × |
| SkIdProyecto | integer | 77 | 0.00 % | |
| SkIdCapitulo | integer | 1389 | 0.00 % | |
| SkIdItems | integer | 34007 | 0.00 % | × |
| SkIdInsumo | integer | 11238 | 0.00 % | × |
| SkIdReforma | logical | 1 | 100.00 % | × |
| SkIdUsuario | integer | 27 | 0.00 % | × |
| SkIdFecha | integer | 2785 | 0.00 % | × |
| SkIdFecha.Real | integer | 2803 | 0.00 % | |
| SkIdEstado | integer | 3 | 0.00 % | |
| Cantidad | numeric | 84100 | 0.00 % | × |
| Valor.Unitario | numeric | 106411 | 0.00 % | × |
| Valor.Total | numeric | 205239 | 0.00 % | × |
| Origen | character | 12 | 0.00 % | × |
| Causa | integer | 16 | 0.00 % | |
| Cantidad.Item | numeric | 14599 | 72.40 % | × |
| Descripcion.Causa | character | 16 | 0.00 % | × |
| Ajuste.Global | integer | 1 | 0.00 % | × |
| Empresa | character | 1 | 0.00 % | × |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 77 |
| Median | 100157 |
| 1st and 3rd quartiles | 10035; 100225 |
| Min. and max. | 1003; 100295 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 1389 |
| Median | 1001572385 |
| 1st and 3rd quartiles | 100291683; 1002253214 |
| Min. and max. | 100346; 1002954534 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 34007 |
| Median | 10057210 |
| 1st and 3rd quartiles | 10026866; 10084837 |
| Min. and max. | 1002462; 100145631 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 11238 |
| Median | 1005999 |
| 1st and 3rd quartiles | 1002291; 1008656.5 |
| Min. and max. | 100101; 10019349 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 27 |
| Median | 100140 |
| 1st and 3rd quartiles | 100140; 100370 |
| Min. and max. | 100; 100513 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 2785 |
| Median | 20190521 |
| 1st and 3rd quartiles | 20151020; 20231011 |
| Min. and max. | 19000101; 20251031 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 2803 |
| Median | 20190524 |
| 1st and 3rd quartiles | 20151025; 20231011 |
| Min. and max. | 20110929; 20251031 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 3 |
| Mode | “100101” |
| Reference category | 100100 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 84100 |
| Median | 0 |
| 1st and 3rd quartiles | -2; 11.58 |
| Min. and max. | -8.145313e+13; 8.145324e+13 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 106411 |
| Median | 11781 |
| 1st and 3rd quartiles | 968.96; 77350 |
| Min. and max. | -159269017478500; 170210911826087 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 205239 |
| Median | 10090.27 |
| 1st and 3rd quartiles | -308037.4; 805160.85 |
| Min. and max. | -521909894612433; 521909894612433 |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 12 |
| Mode | “” |
The following suspected missing value codes enter as regular values: "".
Note that the following levels have at most five observations: "AMP".
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 16 |
| Median | 18 |
| 1st and 3rd quartiles | 2; 40 |
| Min. and max. | 1; 63 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 199195 (72.4 %) |
| Number of unique values | 14598 |
| Median | 1 |
| 1st and 3rd quartiles | -37.61; 49.2 |
| Min. and max. | -399981840.74; 399981840.74 |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 16 |
| Mode | “Actividad Terminada *” |
The following values appear with prefixed or suffixed white space: "Actualizacion de precios ", "C. Especificaciones ".
Note that the following levels have at most five observations: "C. C. Mano de Obra".
Report generation information:
Created by: RamiroSeb (username:
SEBASTIAN).
Report creation time: dom nov. 02 2025 14:33:44
Report was run from directory:
D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review
dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]
R version 4.5.0 (2025-04-11 ucrt).
Platform: x86_64-w64-mingw32/x64(America/Bogota).
Function call:
makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)
The dataset examined has the following dimensions:
| Feature | Result |
|---|---|
| Number of observations | 2872 |
| Number of variables | 12 |
The following variable checks were performed, depending on the data type of each variable:
| character | factor | labelled | haven labelled | numeric | integer | logical | Date | |
|---|---|---|---|---|---|---|---|---|
| Identify miscoded missing values | × | × | × | × | × | × | × | |
| Identify prefixed and suffixed whitespace | × | × | × | × | ||||
| Identify levels with < 6 obs. | × | × | × | × | ||||
| Identify case issues | × | × | × | × | ||||
| Identify misclassified numeric or integer variables | × | × | × | × | ||||
| Identify outliers | × | × | × |
Please note that all numerical values in the following have been rounded to 2 decimals.
| Variable class | # unique values | Missing observations | Any problems? | |
|---|---|---|---|---|
| SkIdEmpresa | integer | 1 | 0.00 % | × |
| SkIdProyecto | integer | 35 | 0.00 % | × |
| SkIdTercero | integer | 70 | 0.00 % | × |
| SkIdFecha | integer | 385 | 0.00 % | |
| SkIdInsumo | integer | 840 | 0.00 % | × |
| SkIdBodega | integer | 33 | 0.00 % | × |
| Numero.Reintegro | integer | 1147 | 0.00 % | × |
| Remision | character | 115 | 0.00 % | × |
| Cantidad | numeric | 758 | 0.00 % | × |
| Valor.Unitario | numeric | 1381 | 0.00 % | × |
| Valor.Total | numeric | 2297 | 0.00 % | × |
| Empresa | character | 1 | 0.00 % | × |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 35 |
| Median | 100217 |
| 1st and 3rd quartiles | 100211; 100226 |
| Min. and max. | 10031; 100275 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 70 |
| Median | 11778 |
| 1st and 3rd quartiles | 11778; 11778 |
| Min. and max. | 1377; 12701 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 385 |
| Median | 20230927 |
| 1st and 3rd quartiles | 20220824; 20241203.25 |
| Min. and max. | 20190416; 20251031 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 840 |
| Median | 1005212 |
| 1st and 3rd quartiles | 1001985; 10010498 |
| Min. and max. | 100101; 10018213 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 33 |
| Median | 1000217 |
| 1st and 3rd quartiles | 1000204; 1000226 |
| Min. and max. | 100; 1000275 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 1147 |
| Median | 2170099 |
| 1st and 3rd quartiles | 2110002; 2260085 |
| Min. and max. | 310001; 2750002 |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 115 |
| Mode | “” |
The following suspected missing value codes enter as regular values: "", "8", "9".
The following values appear with prefixed or suffixed white space: " 2620009".
Note that the following levels have at most five observations: " 2620009", "0109", "0192", "04", "05", …, "ajuste", "NC -099", "R2930", "R3817", "salida 310" (90 values omitted).
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 758 |
| Median | 10 |
| 1st and 3rd quartiles | 2; 50.9 |
| Min. and max. | 0; 272825 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 1381 |
| Median | 14109.6 |
| 1st and 3rd quartiles | 4636.24; 55181.49 |
| Min. and max. | 0; 688343461.35 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 2297 |
| Median | 267571.5 |
| 1st and 3rd quartiles | 53550; 1171117.42 |
| Min. and max. | 0; 1331291738.96 |
Report generation information:
Created by: RamiroSeb (username:
SEBASTIAN).
Report creation time: dom nov. 02 2025 14:33:47
Report was run from directory:
D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review
dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]
R version 4.5.0 (2025-04-11 ucrt).
Platform: x86_64-w64-mingw32/x64(America/Bogota).
Function call:
makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)
The dataset examined has the following dimensions:
| Feature | Result |
|---|---|
| Number of observations | 175811 |
| Number of variables | 18 |
The following variable checks were performed, depending on the data type of each variable:
| character | factor | labelled | haven labelled | numeric | integer | logical | Date | |
|---|---|---|---|---|---|---|---|---|
| Identify miscoded missing values | × | × | × | × | × | × | × | |
| Identify prefixed and suffixed whitespace | × | × | × | × | ||||
| Identify levels with < 6 obs. | × | × | × | × | ||||
| Identify case issues | × | × | × | × | ||||
| Identify misclassified numeric or integer variables | × | × | × | × | ||||
| Identify outliers | × | × | × |
Please note that all numerical values in the following have been rounded to 2 decimals.
| Variable class | # unique values | Missing observations | Any problems? | |
|---|---|---|---|---|
| SkIdEmpresa | integer | 1 | 0.00 % | × |
| SkIdProyecto | integer | 67 | 0.00 % | |
| SkIdFechaSalida | integer | 3801 | 0.00 % | |
| SkIdInsumo | integer | 6667 | 0.00 % | × |
| SkIdTercero | numeric | 524 | 3.12 % | × |
| SkIdOrigenDelDocumento | integer | 1 | 0.00 % | × |
| SkIdEstadoPorDocumento | integer | 2 | 0.00 % | |
| SkIdItems | integer | 12573 | 0.00 % | × |
| SkIdBodega | integer | 67 | 0.00 % | |
| Salida.Numero | numeric | 56568 | 0.00 % | |
| Salida.Remision | character | 20174 | 0.00 % | × |
| Salida.Usuario | character | 65 | 0.00 % | × |
| Salida.Descuento | character | 2 | 0.00 % | |
| Salida.Cantidad | numeric | 14862 | 0.00 % | × |
| Salida.Valor.Unitario | numeric | 29829 | 0.00 % | × |
| Salida.Valor.Total | numeric | 79492 | 0.00 % | × |
| Descuentos.Cantidad | numeric | 14870 | 0.00 % | × |
| Empresa | character | 1 | 0.00 % | × |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 67 |
| Median | 100132 |
| 1st and 3rd quartiles | 10029; 100210 |
| Min. and max. | 1003; 100294 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 3801 |
| Median | 20181228 |
| 1st and 3rd quartiles | 20140826; 20230512 |
| Min. and max. | 20110819; 20251031 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 6667 |
| Median | 1003993 |
| 1st and 3rd quartiles | 1001404; 1008154 |
| Min. and max. | 100101; 10019220 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 5486 (3.12 %) |
| Number of unique values | 523 |
| Median | 11778 |
| 1st and 3rd quartiles | 7292; 11778 |
| Min. and max. | 22; 12701 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 2 |
| Mode | “100120” |
| Reference category | 100120 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 12573 |
| Median | 10054420 |
| 1st and 3rd quartiles | 10020359; 10076864 |
| Min. and max. | 100; 100144957 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 67 |
| Median | 1000132 |
| 1st and 3rd quartiles | 100029; 1000210 |
| Min. and max. | 10003; 1000294 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 56568 |
| Median | 1320073 |
| 1st and 3rd quartiles | 290527.5; 21100605 |
| Min. and max. | 0; 174000235 |
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 20174 |
| Mode | “” |
The following suspected missing value codes enter as regular values: "", " ", " ", ". 1553", ".1766", …, "888", "9", "99", "999", "9999" (4 values omitted).
The following values appear with prefixed or suffixed white space: " ", " ", " 2173", " 1256", " 0346", …, "CONTROL F ", "desc. ", "DEV ACERO ", "MAYO ", "mayo 31 " (60 values omitted).
Note that the following levels have at most five observations: " 1256", " 0346", " 957", " 0098", " 0318", …, "sc0465", "v", "vario", "Ver resume", "VS14061" (12851 values omitted).
Note that there might be case problems with the following levels: "acero", "Acero", "ACERO", "acta 4", "ACTA 4", …, "XX", "xxx", "XXX", "xxxx", "XXXX" (57 values omitted).
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 65 |
| Mode | “Julio Cesar Gomez Sanchez” |
The following values appear with prefixed or suffixed white space: "Maria Mercedes Arias ".
Note that the following levels have at most five observations: "Carlos Alfonso Maury Maury", "Diego Urrego Perez", "Erick Jose Ocon Gomez", "Nubia Andrea Lara Palma".
| Feature | Result |
|---|---|
| Variable type | character |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 2 |
| Mode | “NO” |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 14862 |
| Median | 6 |
| 1st and 3rd quartiles | 2; 40 |
| Min. and max. | 0; 2973948.8 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 29829 |
| Median | 10015.81 |
| 1st and 3rd quartiles | 3195.8; 31571.27 |
| Min. and max. | -1160; 688343461.35 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 79492 |
| Median | 111860 |
| 1st and 3rd quartiles | 19784.16; 661200 |
| Min. and max. | -638000; 14511842644.69 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 14870 |
| Median | 6 |
| 1st and 3rd quartiles | 2; 40 |
| Min. and max. | -4; 2973948.8 |
Report generation information:
Created by: RamiroSeb (username:
SEBASTIAN).
Report creation time: dom nov. 02 2025 14:33:56
Report was run from directory:
D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review
dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]
R version 4.5.0 (2025-04-11 ucrt).
Platform: x86_64-w64-mingw32/x64(America/Bogota).
Function call:
makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)
The dataset examined has the following dimensions:
| Feature | Result |
|---|---|
| Number of observations | 608 |
| Number of variables | 15 |
The following variable checks were performed, depending on the data type of each variable:
| character | factor | labelled | haven labelled | numeric | integer | logical | Date | |
|---|---|---|---|---|---|---|---|---|
| Identify miscoded missing values | × | × | × | × | × | × | × | |
| Identify prefixed and suffixed whitespace | × | × | × | × | ||||
| Identify levels with < 6 obs. | × | × | × | × | ||||
| Identify case issues | × | × | × | × | ||||
| Identify misclassified numeric or integer variables | × | × | × | × | ||||
| Identify outliers | × | × | × |
Please note that all numerical values in the following have been rounded to 2 decimals.
| Variable class | # unique values | Missing observations | Any problems? | |
|---|---|---|---|---|
| SkIdEmpresa | integer | 1 | 0.00 % | × |
| SkIdProyecto.Traslado | integer | 24 | 0.00 % | |
| SkIdProyecto.Entrada | integer | 8 | 0.00 % | |
| SkIdInsumo | integer | 373 | 0.00 % | × |
| SkIdFecha | integer | 81 | 0.00 % | |
| SkIdEstadoPorDocumento | integer | 3 | 0.00 % | |
| Numero.Traslado | integer | 145 | 0.00 % | |
| Cantidad.Traslado | numeric | 217 | 0.00 % | × |
| Valor.Unitario.Traslado | numeric | 482 | 0.00 % | × |
| Valor.Total.Traslado | numeric | 547 | 0.00 % | × |
| Numero.Entrada.Traslado | numeric | 23 | 72.37 % | × |
| Cantidad.Entrada.Traslado | numeric | 94 | 0.00 % | × |
| Unitario.Entrada.Traslado | numeric | 138 | 0.00 % | × |
| Total.Entrada.Traslado | numeric | 165 | 0.00 % | × |
| Empresa | character | 1 | 0.00 % | × |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 24 |
| Median | 100184 |
| 1st and 3rd quartiles | 10029; 100211 |
| Min. and max. | 1005; 100241 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 8 |
| Median | 100 |
| 1st and 3rd quartiles | 100; 100217 |
| Min. and max. | 100; 100241 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 373 |
| Median | 1005025 |
| 1st and 3rd quartiles | 1002279.5; 1008405 |
| Min. and max. | 100143; 10017507 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 81 |
| Median | 20201013 |
| 1st and 3rd quartiles | 20150528; 20230713 |
| Min. and max. | 20121113; 20251029 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 3 |
| Mode | “100141” |
| Reference category | 100140 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 145 |
| Median | 212 |
| 1st and 3rd quartiles | 90.75; 237 |
| Min. and max. | 3; 250 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 217 |
| Median | 18 |
| 1st and 3rd quartiles | 4; 148.5 |
| Min. and max. | 1; 32825 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 482 |
| Median | 8731.95 |
| 1st and 3rd quartiles | 2783; 29206.17 |
| Min. and max. | 20; 3153947.44 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 547 |
| Median | 233468.47 |
| 1st and 3rd quartiles | 52092.28; 1329719.03 |
| Min. and max. | 1013.63; 118986074.62 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 440 (72.37 %) |
| Number of unique values | 22 |
| Median | 209 |
| 1st and 3rd quartiles | 203; 209 |
| Min. and max. | 189; 212 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 94 |
| Median | 0 |
| 1st and 3rd quartiles | 0; 1 |
| Min. and max. | 0; 26312.71 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 138 |
| Median | 0 |
| 1st and 3rd quartiles | 0; 3289.76 |
| Min. and max. | 0; 1195340.72 |
| Feature | Result |
|---|---|
| Variable type | numeric |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 165 |
| Median | 0 |
| 1st and 3rd quartiles | 0; 25971.75 |
| Min. and max. | 0; 118986074.62 |
Report generation information:
Created by: RamiroSeb (username:
SEBASTIAN).
Report creation time: dom nov. 02 2025 14:34:34
Report was run from directory:
D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review
dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]
R version 4.5.0 (2025-04-11 ucrt).
Platform: x86_64-w64-mingw32/x64(America/Bogota).
Function call:
makeDataReport(data = df, output = "html", file = paste0("20251102/reporte_", nombre, ".html"), replace = T, openResult = F)
The dataset examined has the following dimensions:
| Feature | Result |
|---|---|
| Number of observations | 275143 |
| Number of variables | 19 |
The following variable checks were performed, depending on the data type of each variable:
| character | factor | labelled | haven labelled | numeric | integer | logical | Date | |
|---|---|---|---|---|---|---|---|---|
| Identify miscoded missing values | × | × | × | × | × | × | × | |
| Identify prefixed and suffixed whitespace | × | × | × | × | ||||
| Identify levels with < 6 obs. | × | × | × | × | ||||
| Identify case issues | × | × | × | × | ||||
| Identify misclassified numeric or integer variables | × | × | × | × | ||||
| Identify outliers | × | × | × |
Please note that all numerical values in the following have been rounded to 2 decimals.
| Variable class | # unique values | Missing observations | Any problems? | |
|---|---|---|---|---|
| SkIdEmpresa | integer | 1 | 0.00 % | × |
| SkIdProyecto | integer | 77 | 0.00 % | |
| SkIdCapitulo | integer | 1389 | 0.00 % | |
| SkIdItems | integer | 34007 | 0.00 % | × |
| SkIdInsumo | integer | 11238 | 0.00 % | × |
| SkIdReforma | logical | 1 | 100.00 % | × |
| SkIdUsuario | integer | 27 | 0.00 % | × |
| SkIdFecha | integer | 2785 | 0.00 % | × |
| SkIdFecha.Real | integer | 2803 | 0.00 % | |
| SkIdEstado | integer | 3 | 0.00 % |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 77 |
| Median | 100157 |
| 1st and 3rd quartiles | 10035; 100225 |
| Min. and max. | 1003; 100295 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 1389 |
| Median | 1001572385 |
| 1st and 3rd quartiles | 100291683; 1002253214 |
| Min. and max. | 100346; 1002954534 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 34007 |
| Median | 10057210 |
| 1st and 3rd quartiles | 10026866; 10084837 |
| Min. and max. | 1002462; 100145631 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 11238 |
| Median | 1005999 |
| 1st and 3rd quartiles | 1002291; 1008656.5 |
| Min. and max. | 100101; 10019349 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 27 |
| Median | 100140 |
| 1st and 3rd quartiles | 100140; 100370 |
| Min. and max. | 100; 100513 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 2785 |
| Median | 20190521 |
| 1st and 3rd quartiles | 20151020; 20231011 |
| Min. and max. | 19000101; 20251031 |
| Feature | Result |
|---|---|
| Variable type | integer |
| Number of missing obs. | 0 (0 %) |
| Number of unique values | 2803 |
| Median | 20190524 |
| 1st and 3rd quartiles | 20151025; 20231011 |
| Min. and max. | 20110929; 20251031 |
Report generation information:
Created by: RamiroSeb (username:
SEBASTIAN).
Report creation time: dom nov. 02 2025 14:12:48
Report was run from directory:
D:/Estudios/Universidad/Ingenieria_estadistica/9.Noveno_semestre_Local/APRO/Programacion/consultoriaConnectBogota/EDA_review
dataMaid v1.4.2 [Pkg: 2025-04-13 from CRAN (R 4.5.2)]
R version 4.5.0 (2025-04-11 ucrt).
Platform: x86_64-w64-mingw32/x64(America/Bogota).
Function call:
makeDataReport(data = df, output = "html", file = "20251102/reporte_eda_Proyeccion", replace = TRUE, openResult = FALSE)